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AUSTkACT 


MisrcgistraUion is but one of a group of parameters (noise, class separability, 
spatial transient response, field sizes) affecting the accuracy of multispectral 
classifications. The entire group must be considered simultaneously. Any noise 
in the measurements (due to the scene, sensor, or to the analog/digital conversion) 
will cause a finite fraction of the measurements to fall outside of the classi- 
fication limits, even within nominally uniform fields. For field boundaries, where 
the effects of misregistration are felt, additional pixels will be misclassified 
due to the mixture of materials in the pixels. Misregistration causes field borders 
in a given (set of) band(s) to be closer than expected to a given pixel, causing 
additional pixels to be misclassified. Simplified models of the various effects 
are used to gain conceptual understanding and to estimate the performance to be 
expected. 
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SECTION 1 


exscutive summary 


INTRODUCTION 

Spectral analysis generally takes the torm ol multispectral 
classitication in which the classification is done by comparing the sample 

measurement vector to the statistics of the set oi known material vectors 
(training statistics) representing all possible claseesi and by using one of 
several decision methods, determining which o£ the knowns it most nearly 
mutches • 

The problem pursued will be the ettects ot misregistration on the 
accuracy ot multispectral classification in answer to the questions 

Whaf ar >2 the effects on multispectral classification accuracy of 
relaxing the overall scene registration accuracy from 0.3 to 0.5 
pixel? 

The misregistration is but one ot a group ot parameters (noise, class 

separability, spatial transient response, field size) which must all be 
considered simultaneously. The thread of the argument (which will be 
discussed in detail below) is this: any noise in the measurements (due to the 

scene, sensor, or the analog to digital process) causes a finite fraction ol 

measurements to fall outside of the classification limits. For field 
boundaries, where the misregistration effects are felt, the misregistration 
causes the border in a given (set of) band(s) to be closer than expected to a 
given pixel, so that the mixed materials in the pixels causes adaitional 
pixels to fall outside ot the class limits. Considerations of the transient 
distance involved in the difference in brightness between adjacent fielus, 
when scaled to "per pixel", allows the estimation of the width of the border 
zones. The entire problem is then scaled to field sizes to allow estimation 
of the global effects. 

This approach allows the estimation of the accuracy ot multispectral 
classification which might be expected for field interiors, the- useful number 
of quantization bits, and one set of criteria for an unbiased classifier. 
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COMCLUb'IONS 


The toUowing brieCly BCated concluBione are developed in detail in the 
body ol the report. 

• The dilterc'f’ce between U.3 ana 0.5 pixel iniBregietration 
IB in the >»* ise tor multi spectral clasBitication. 

• Piecieion users way have to reregister image segments any- 
way, making extreme registration precision by the system 
ot lesB inipaitance. 

• Interpolation algorithm choice is relatively unimportant, 
provided a higher order interpolator is used. 

• It small tielus aie important, small pixels are more important 
than sensor noise contributions. 

In adaxLion, several observations result: 

• System registration to 1-2 pixels should satisty users 
of iilra products. 

• There is a grey area of 0.5 to 1-2 pixels in which the 
requirements tor high precision are not well justified. 


THE IIASIC MODEL 

The expected effect ot misclassitication may be estimateci by a simple 
tirst-oruer approach, because th« differences in classification accuracy 
between the many classification schemes and conditions that have been tested 
are overshadowed by the vagaries in the data and assumptions in the 
classification process, so that higher order analysis will contribute little 
additional undf’cstanding. 


Consider first the probability ol correct identification ot a field 
interior pixel. Field interiors are nonunitonu because of the combined 
effects of sensor noise, scaled to equivalent reflectivity (NEAp ) and 
inherent nonunitoniuties in the field itself. The overall brightness 
distribution is considered to be Gaussian - this is approximately true for 
field interiors, although the distribution deviates considerably toward 
bimodal tor mixed materials at fiela boroers. 

The combined effect ot these various noise sources produces a finite 
probability of misclassification. (Figure S-1) The first-order estimate 
considers the total variance caused by the scene, sensor and quantization as 
compared to the defined class size limits, however these are determined. 
Similar, but relatively second-order, effect may be expected with a higher 
order analysis, proper classifier training, resulting in accurate limits, is 
essential (Hixson et al, I9b0) . 

For simplicity, and because of the later desire to misregister one (or 
more) ot the bands, the discussion will assume that spectral bands as sensed 
will be used, and that Cor recognition, the unknown pixel must fall between 



appropriate Ximiti in every band tested. Thereiore, brightxiesi outside oi a 
limit in any one band is euificient for rejection, so that we need to consider 
only one band at a time. 

The probability oi a sample being witnin the class limits can be derived 
by assuming that an ensemble oi clean signals irom a series oi areas oi the 
same material can be anywhere within the quantizing range with uniform 
probability, but that individual samples are perturbed by the Gaussian noise 
with a distribution equal to cr . The probability distribution oi the signal 
plus noise is iound by convolving the probability distribution of the signal 
with that oi the noise. The probability oi correct class assignment Ci.e., 
the pixel is within the class limits) is then iound by integrating the 
probability distribution between appropriate class limits IFriedman 1905). 
The result oi this calculation is shown in Figure b-2. In the useful range 
ot 6 (3<6<7), the curve can be approximfted by 


(Hog P « - 0.40 

where P » probability ot correct classilitation, and 

g - class size ^ with class size and a in the same units. 

® scene scene 

Sources ot noise will be the scene itsell and tne sensoi, both assumed to 
be random for this analysis. The root mean square (nns) sum is taken to give 
the total eilective noise, A number of pixel measurements may be averaged 
together to reduce the noise belore classification. This final noi,-.e figure 

may be compared to the width of the class to give 3 > frwm which the 
probability P of correct classification may be estimated. This leads to the 
Classification Error Fstimator, Fig. S'-S. 

As an example, consider a scene having a field-interior variation of 3%, 
to be viewed witli a sensor having a total noise figure of 1%, The total 

effective noise seen by the classifier (upper left) will be the rms sum ot 

these, or 3.16%, which tor a total 0-255 digital number (dn) range, Vvould he 
8.1 dn. It the class width (determined by the classifier algorithm) is 25 dn 
(right center) the $ “ 3.1, giving P » 0.742 (right lower). If this P is 
not accurate enought for the analysis, several pixels must be averagea(right 
upper): a 2x2 averaging wivl raise 3 to 6.2, giving a new P * 0.86. 

Considering 3 in this way allows an estimation of the total noise 
permissible as it affects the attainable classification accuracy. It the 

amount of scene noise to be encountered in a given classification task can be 
estimated, the allowable extra noise from the sensor and quantization can be 
specified by estimating the loss ot accuracy of the classification caused by 
quantization error. This leads to an estimate of the number ot bits which 
will be useful. 

Define the perfect sensor as having no random noise nor quantization error 
(i.e, , an infinite number of bits). Th;* s will define (tor nxn pixels averaged) 

g - class size . n and '^o " 10“®* ^^^o ^ 

” o scene 
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For the real, 


sensor, 6<S, because o£ 


the 


Finite 


ana quantization' 


The new probability of correct classification P is related to Pq by: 


A plot 
S-4, for 
allocation 


P . P <e/s„) 

0 

of the loss in classification accuracy vs. Pq is given in 
the parameter families 0o/6 and 
starts with aefining of the desired P 


sensor/ cr scene* 


Figure 

Noise 


and 


Detinition 

a 


of the 


ascertaining that the 
allowed AP determines (e.g., 
An estimation of the scene 


required B^can be obtainea. 

from the graph) the allowed ^sensor/ O’ scene 
noise for which the other conditions apply allows the calculation of the total 
sensor noise allowed. The final step is to partition this noise between 
sensor random noise and quantization noise. 


For example, let the desired P^ * 85^ and allow no more than 2^ loss 
due to the total sensor noise. Ihe no-sensor-noise 6o tiust oe ^5.7 to give 
Pq. Then, from Figure S-4, the allowed ^sensor “ *^*^ ^ ^ scene* 
the scene has a O' gcene “ allowable O * 0.6x2% “ 

1.2%, which must be partitioned between NEAp and the q uantization noise. 
B’or NEAp “ 1%, the allowable o q^g^t " y 1-2^ - l^ » 0.66%, which 
can be met by 6-bit quantization. 

Two observations are important here: (1) Increasing the number of bits of 

quantization produces improvements which asymptotically approach zero, as each 
successive bit reduces the step size by a factor of 1/2. (2) A scene having 

as little as 2% variation is a very uniform scene. Since this noise is rms'd 
with the sensor noise, it will overwhelm any but a very noisy senior. 
Therefore, tor purposes of multispec cral classification, more than six bits 
would seem to oe unnecessary. 


EDGE EFFECTS 


To this point, the analysis is based on pixels well inside uniform fields 
and well away from field boundaries. A number of experimenters have spent 

appreciable time discovering that classification accuracy falls oft at 

boundaries due to what has become known as the mixed-pixel effect. We will 
start at that point and attempt to model the effect to allow us to quantity 
Our expectations. 

We assume as a starting point that all the spectral bands used in 

classification, whether obtained from one date or series of dates, are in 
perfect registration. This means Chat when the pixel grids from each band are 
aligned the aata contents (tield borders, roads, all features) are also 

aligned - note tnat this is more than simply having all internal distortions 
removed, wnich is all that most geometric rectifications accomplish. 
Misregistration will (later) be considered as the lack of alignment of the 
pixel grids; because the computer can only work with pixel grids, aligning 
these pixel grids appears to the computer as a shift in the boundaries. We 
will assume that training samples are accurate and that class limits have been 
set from these by the classifier chosen. The classification is modelled as 

follows: signature shitting in any individual band will tend to cause 

misc lassification, so that the situation may be treated one band at a 
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time. The etfects ot pxxel mixture in all banas may then be rms*o together if 
desirea. The entire analysis simplifies to the consiaeration of the transient 
intensity shift across field boundaries as compared to the class limits and 
the noise components of the measurement. 

The first step in analyzing the spatial extent of pi>. >4. mixing across 
borders :s to estimate the shape ana extent of the transient intensity shift. 
If the impulse response functions or the modulation transfer functions IhTFs) 
of the various components (and, hence, the entire system) are known, a precise 
transient response may be calculated. For example, the specifications for the 
Thematic Mapper tor Landsat D call for a 2% to ')%% time equivalent of about 2 
pixels implying a l0%-90% transient response of about 1.3 pixel. The 
practical result of this is that the "infinitely sharp" edges of the real 
scene will be softened by the filtering effect of the scanning aperture 
(assumed to be rectangular and having uniform response) and it is this 
softened transient response which is sampled. Interpolation required for 
registration will cause some further softening, and the use ot any of the 
competent higher-order interpolation functions (ainx/x, TRU cubic convolution, 
modified cubic convolution, other splines) will have minor effects of the rise 
tiii'e. A total '^10-90 (transient response from 10% to 90%) of 1.5 pixels 
with no ringing will be used as a surrogate global value. 

The transient situation across a border is sketched in Fig. S-5. We are 
concerned here with the decrease in probability that a given pixel will have a 
value within the class limits as that pixel moves toward the boundary, as 
shown in Figure S-6. The analysis only needs to aetermine the area unaer the 
normal curve (assuming the noise is Gaussian) between the limits as determined 
by the classification class size and the offset from the "field interior value" 
caused by the mixture. The important scaling involved is the amount ot signal 
shift caused by the transient total shitt T, as related to tne aesirea class 
size S, for a given 3* The left portion of Figure S-7 reflects this shift in 
brightness (vertical axis) as it affects the area within the class (the 
probability of recognition) . 

The transient rise distance estimated for the Thematic Mapper has very 
close to a Gaussian shape and a '^10-90 “ pixel. The amount of 
brightness shift is the difference between the brightness of the field under 
consideration and the adjacent field which is causing the shift. The 
important intensity relation is the magnitude of this shitt, T, as related to 
the size S of the class being tested by the ratio T/b. These curves, for 
various T/S , are combined with the probability curves of the previous 
discussion in Figure S-7. From this may be estimated the loss in probability 
in classification of pixels near borders. 


BIAS IN FIELD SIZE ESTIMATION 

It can be appreciated that several things are happening simultaneously: 
If the lower limit of fiela B and the upper limit of field A have a gap 
between, pixels "lost" by field B will not be picked up by field A, and will 
be considered unknowns and not be counted in either field. The lost pixels 
will be some interior pixels, cue to insufficient 3 , end a large number ot 
near-border pixels, resulting in apparent field size loss. Only if the lower 
limit of field B and the upper limit of fiela A are coincident will 
pixels lost from one tield be picked up by the other, and vice versa, to give 
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complete account of all pixels, for Che tield size estimator to be unbiased, 
the loss nd-pickup in both directions must cancel; Chat is, on Che average 
Che Crue oorder must be located. The total effect will depend on Che ratio of 
the number of border pixels to the number of field-interior pixels, and hence 
is a function of the field shape and size. 

This leads directly to the required algorithm for tield size estimation: 
First divide the scene into blobs, each of which is sufficiently uniform, ana 
with closed boundaries. Then tor each blob (field) determine the average 
brightness for all the interior pixels which are safely away from the border. 
For each segment of the border, the correct field edge decision level is 
midway (in o's) between the average brightness of the two fields on either 
side. After the borders are located using this criterion, tne tield interiors 
may be reclassified using Che classification limits as determinea from the 
training samples. 


EFFECTS OF MISREGISTRATION 

In preparation for estimation of Che misregistration effects, an analysis 
will first be made of the expectations of registered data and the sensitivity 
to tne various parameters estimated. The starting model used has rectangular 
tielas aligned with the pixel grid. Pixels are grouped into four zones: 1) 

Interior (i)-those with centers 2 or more pixels inside borders, 2) Inner 
border (ib)-pixels with centers 1-1/2 pixel inside borders, 3) Outer border 
(ob)-pixels with centers 1/2 pixel insioe borders, 4) Exterior border 
(xb)-pixels outsiae Che borders, with centers 1/2 pixel outside. Estimates of 
classification accuracy for each zone are obtained from Figure S-7 . The total 
estimate of classification accuracy is the sum of pixels in each zone 
multiplied by the corresponding zone accuracy estimate. Later, the field will 
be mis registered , changes in the number of pixels in each zone calculated, and 
the probabilities again summed. The following parameters are required: 

r - the field shape ratio, length of long siae/length ot short side 

T - transient brightness difference between tield being considered 
and its neighbor 

5 - decision class size 

T - transient distance for 10% to 90% response 

3 - class size S/a of Gaussian noise 

The tollowing global values selected for the parameters are considered to be 
representative : 

r =2 

T/S “ 1 to 5 

T = 1.5 pixels 

6 = 3 to 5 

After the parameters r, T/S, T , and 6 are selected, the resultant (from 
Fig. b-7) probabilities are substituted for the brightnesses in the various 
zones Co produce a "probability image" aligned with the desired output pixel 
grid. The probability assigned to a pixel at a given location represents the 
probability that that pixel vjill have a brightness falling within the 
classification limit determined by the classifier, for the given spectral 
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banu. Ihe total probability ot correct classitication is given by 


1 

Cll? 


‘’l"i + P 


lb"lb 


ob*^ob 


+ Pvi,n 


xb xb 


where nj^ is tfte field wietli (short side) in pixels, ana n^, n^jj, n0tj, 
are the number of pixels in the various zones. Using these values, the 
global estimate ot the probability ot correct c lassitication with no 
misregistration is given Figure b-b for three values of T/S. The predominant 
effect IS the pixel mixture (the eftect ot T/b). As expected, clus is worst 
tor small fields (n^ small) because ot the larger percentage of border 
pixels tor these fields. Note that tor T/S “ 1, decision level midway between 
brightnesses of adjacent fields, no probability loss occurs, even with small 
fields. Unfortunately, this desirable condition cannot be systematically 
obtained . 


MISREGISTRATION OF CONGRUENT FIELDS 

The initial model for misregistration is a displacement of d pixels, equal 
in both X ana y. The result of this misregistration is that some area is lost 
from the external border, causing a further classification accuracy decrease. 
The misregistration loss as seen by the external border loss is given by 

Ai- - Prt [■* ^ 

The basic character of this misregistration loss term is l/n^, so tnat it 
will nave a slope approximately equal to -I on a log-log plot vs n^. The 
precise results depend critically on tne values of p^g estimated for Che 
Pjjg from Figure b-7: 


T/S 

fi 

T = 1 

1 - 1.5 

[ » 2 


3 

.10 

.14 

.20 

1 

5 

.02 

.025 

.07 


7 

0 

.01 

.04 


3 

0 

0 

0 

2 

5 

0 

0 

0 


7 

0 

0 

0 


Using these values, Che loss AP due Co displacement misregistration is 
plotted in Figure S-9 for various parameter combinations. 


MISREGISTRATION DUE TO NONCONGRUENCE 
1. SIZE AND RATIO (ASPECT) CHANGES 

Size and aspect ratio changes can come about from several causes such as 
scan velocity or altitude changes, ana if uncompensated can cause additional 
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misregistration errors. Progressive misregistration from a point of accurate 
registration will be caused by both causes (Figure S-lOa); the modeling of 
this effect considers first that size changes N ■ n'/n will cause a shift in 
points n to points n' both vertically and horizontally, and then that changes 
in aspect ratio will cause further shifts in the horizontal position of 
vertical borders by changing the field shape ratios by the factor R * r'/r. 
The resulting shifts are: 


A tty ® (N - l) Oy and Anj, * (NR - 1) rny 


For analysis, this shift will be divided around the borders symmetrically as 
optimum field registration is accomplished (Figure S-lOb). Two cases must be 
distinguished (using scan velocity as a surrogate cause): 

Case 1: A slow scan decreases pixel spacing and puts more pixels into a 
given field. When these are placed into the output grid, the field appears 
stretched. The field as defined by the other (correct) bands now covers only 
part of the stretched field, so that the classification tends to see only 
interior pixels, and the accuracy will increase, ultimately reaching Che 
field-interior accuracy. The sizes of the border errors are; 

1 , 1 , 

®l “ 2 - 1) n^ and ^'^1 


Case II: A fast scan has the opposite effect, causing the field to appear 
smaller and the analysis pixels defined by the other bands now include more 
exterior pixels. The classification accuracy will decrease. 


For fast scan, the smaller apparent field covers an area expressed as a 
fraction f^^ of the total: 



2 

RN 


Fractional Areas: 


xb 


2Nm -f 2NRmr -f 4 


rn 


1 


(Interior) 


(External Border) 


The total expected probability is 


^tot “ ^iPi ^xb Pxb 

Since the external border pixels are now included within the analyzed 
field, but with a low probability, the fractional area RN^ represents 
approximately the fraction of the basic field-interior accuracy to be 
expected. Since the total size shrinkage (in pixels) is small for small n^, 
only larger n^ need be considered, and the l/n^ term may be dropped. 
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This allows Ptot approximated for r » 2 by : 

Ptot ® RN^Pi + -^Pxb 
"l 

For large fields, the probability is seen to be indepenaent of field size, and 
only weakly aependent (because of low for small sizes. 

2. WAVY BORUEKS AND MULTIPLE ACQUISTIONS 

For single-band analysis, with borders aistorted so that there are pixels 
both insiae and outside of the analyzed area, some pixels will have increased 
probabilities of correct classification and some will have less. The oecrease 
in probability across border is (very) approximately linear, so that the 
(signed) average displacement will model the effect. 

For multiband analysis, those pixels having a low probability of 
classification will have the largest effect as the net probability at each 
pixel location is the proauct of the probabilities obtained for each 
acquisition (band). In this case the rms displacement will produce a better 
model of the effects. 


SOME OBSERVATIONS 

1. ON BASIC CLASSIFICATION 

• The total noise figure (compared to the class size in a given 

determination) controls g , and in turn controls the maximum 
attainable classification accuracy. However, for practical range of 
3 < 8 < 7, increasing 8 has only a moderate effect. 

• Because of this, if small fields are most important, the reflected 

energy might more profitably be divided into smaller pixels, even at 
the expense of NEAp . As this will cause an increase in data rate, 

optimum coding should be investigated. The possible noise introduced 

in reconstructing the aata will cause some further decrease in the 
overall effective NEAP and so decreases 3 . But since there is 
smaller sensitivity to 3 than to l/n^, there should be a net gain 
in utility . 

• Increasing the number of bits of quantization produces improvements 
which asymptotically approach zero, as each successive bit reduces 
the step size by a factor of 1/2. 

• A scene having as little as 2% variation is a very uniform scene. 

Since this noise is rms’d witn the sensor noise, it will overwhelm 

any but a very noisy sensor. Therefore, for purposes of 
multispectral classification, an extreme number of bits would seem to 
be unnecessary . 

2. ON EDGE EFFECTS 

• For accurate field size estimation, the decision brightness must be 
halfway between the brightnesses of the fields on either side of a 
given boundary. This means tnat classifiers set for material 
identification will in general produce errors in field size. But Che 


1-9 


field-interior brightness is increasingly hard to estimate for small 
fields because of the fewer interior pixels. 

• It is important to keep the transient response distance and the 
accompanying sample spacing small, to get as many pixels into a given 
ground distance as possible. Field area errors become large at nj^ 
“ 5 or less. The transient distance must also be matched between 
spectral bands. 

• At the resolution expected for the Thematic ^Ulpper, the atmospheric 
point spread function may become more r’ominant than the Thematic 
Mapper point spread function. If this is determined to be true, the 
registration requirements may be relaxed since scene-dependent 
registration will be required anyway. 


3. ON MISREGISTRATION 

• For large T/S (i.e., 2 or more) the edge effects are so great that 
the base probability is drastically affected, and the external border 
pixels have zero probability of being within the class limits. For 
this reason, there is no misregistration effect for large T/S. 

• Square fields show the moat misregistration loss, when scaled to 

ni. 

• A shape ratio r*2 is believed to be representative. 

• Misregistration loss decreases with higher 3 • However, these losses 

in general are small to begin with, and the discussion calling for 
sacrifice of 3 to gain smaller IFOV (more pixels n^ into a given 

tield) would seem to override. 

• Increase in t decreases the basic accuracy of edge pixels and also 

increases the misregistration losses. 

• Geometric rectification and registration procedures must not only 

remove the internal distortions but must also produce pixels on a 
defined (preferably ground-referenced) grid. Current procedures do 
not do this. Without this rererence grid, users will have to 
re interpolate before multi-temporal data can be compared. 

• Scale and aspect ratio errors will have only minor effects on 
moderate-area problems. But they will cause problems in correlating 
over large distances. 

• Altitude relief displacement will require users to use many control 
points to register images in areas of high relief. 

• Unless standard reference grias are established, users requiring 
registration will have to interpolate every image, even in low relief 
areas. 

• For single-band analysis, the algebraic average of the displacement 
may be used. For multiband analysis, with erratic errors in location 
among the bands, the lowest probability of correct classification 
holds and the rms of the displacements is appropriate. 
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AN UNANSWERED QUESTION 


This reporc models Che potencial roisregistraCion ettecCs on raultispectral 
classiiicacion accuracy. XC may allow che comparison ol che various cescs and 
simulations, and points out the variables which must be reported tor those 
simulations to allow their validation. It does not answer the following 
question: Given a certain loss in accuracy due to misregistration, how does 

that damage che ability to use the data analysis results? These evaluations 
will be discipline dependent, and must be sought separately. 
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i-1 Friedman, H.D. , On the Expected Error in the Probability of 
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Figure .S-1 Effect of Noise on the Probability of Correct Multi- 
spectral Classification 



^ = CUSS SIZE/rms NOiSE 

Figure S-2 Given a Signal Uniformly Probable over the Dynamic Range, and 
Gaussian Noise with standard Deviation ■ a . The curve shows 
the Probability of Correctly Recognizing a class corresponding 
to the Noise-free Signal as a Function of the Ratio 
<3 ■ class size / a . 
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PiXEL SPACING 


Figure S-5 Cross Ssction of brightness trees ecross e boundary between 
two fields# showing the distence required for the bright- 
ness trensition. 
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Figure S-6 the distribution of "field” pirels noves down the trensition 
curve es the neasureaent point noves toward the boundary. 

The shaded area is the proportion which will be correctly 
classified. 
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PIXEL CENTER DISTANCE FROM BORDER - PIXELS 

Figure S-7 Combined Curves for Translation of Pixel Distance from 
Border to Shift in the Brlphtness (Scaled In Units of Class 
Sire, S ) to Probability of Correct Classification. 
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Figure S-8 Probability of Correct Classification using Global 
Parameters, for Perfectly Registered Pixels. 

One Sne.ctral Band Only. 





a. GENERAL CASE b, BEST REGISTRATION CASE 

(NO TRANSLATION DISPLACEMENT) 


Figure R-10 Construction for Estimating Misregistration Caused 
By Sire and Aspect F.rrors. 
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Figure S-9 Loss of Classification 
Accuracy due to Misregistration of 
One Band, for Various Parameter Com- 
binations. 

3 ■ Class slze/o of noise 
r • Field Shape Ratio, long/ahort sides 
T ■ 10-90% transient distance 
Oj ■ length of short side, pixels 
d ■ displacement, pixels 
AP » loss in probability 


I 
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SECTION II 


DETAILED REPORT 


INTRODUCTION 

Spectral analysis generally takes the term o£ multispectral classification 
in which the classification is done by comparing the sample measurement vector 
to the statistics of the set of known material vectors (training statistics) 
representing all possible classes, and by using one of several decision 
methods, determining which of the knowns it most nearly matches. 
Alternatively, pixel clusters are determined in spectral space, after which 
the materials forming each cluster are identified. As these have been treated 
extensively in the Literature, the details will not be pursued here. Rather, 
this stuay will consider some perturbing effects from a generic point of view, 
which may allow us to judge the effects of various sources of error, ana which 
will provide models against which empirical studies may be evaluated. 

The problem pursued will be the effects of misregistration on the accuracy 
of multispectral classification in anwser to the question: 

Wliat are the effects on multispectral classification accuracy of 

relaxing the overall scene registration accuracy from 0.3 to U.5 

pixel? 

This is a problem of particular importance because of the prevalance of 
analyses requiring the use of data from more than one acquisition. Here, 
acquisition includes several spectral bands, requiring bana-to-band 
registration, or the combination of several data sets from diverse sources, 
such as combining data from the multispectral scanner (tiSS) and the Thematic 
Mapper (TM), the combination .if data from both space and non-space sources, or 
short or long range temporal studies requiring overlay of temporally related 
data. These are discussed in some detail in Bryant, 1981. 

Misregistration is but one of a group of parameters (noise, class 
separability, spatial transient response, field sizes) which must all be 
considered simultaneously. The thread of the argument (which is aiscussea in 
detail below) is this: any noise in the measurements (due to the scene, 

sensor, or the analog to digital process) causes a finite fraction of 

measurements to fall outside of the classification limits. For field 

boundaries, where the misregistration effects are felt, the misregistration 
causes the boraer in a given (set of) band(s) to be closer chan expected to a 
given pixel, so Chat the mixed pixel effect causes aadicional pixels to fall 
outside of the class limits. Considerations of the transient aiscance 

involved in the aifference in brightness between adjacent fields, when scalea 
to "per pixel", allows the estimation of the width of the border zones. The 
entire problem is then scaled to field sizes to allow estimation of the global 
erfects . 
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Ihis approach allows the estimation of the accuracy of multispectrai 
classification which might be expected for field interiors, the useful number 
of quantization bits, and one set of criteria tor an unbiased classifier. 


THE BASIC MODEL 

The expected effect of misclassif ication may be estimated by a simple 
tirst-oraer approach, because the differences in classification accuracy 
between the many classification schemes ana conditions that have been tested 
are oversbaoowed by the vagaries and assumptions in the classification 
process, so that higher oraer analysis will contribute little additional 
understanding. Further, it is assumed that the training statistics have been 
correctly determined from known materials, that the decision rules have fixed 
the decision boundaries in multispectrai space, and chat the IFOV (the basic 
resolution of the system) is fixed. The assumption of correctness of the 
decision limits as sec by the classifier from the training samples is crucial; 
it has been found (Hixson et al, I960) that under some conditions accuracy of 
the training is relatively more important than selection of the classification 
algorithm. This analysis concerns single pixels only, without taking 
advantage of the possible increase in accuracy obtainable by considering Che 
neighboring pixels or che advantages obtained by multiteraporal analysis. 

Consider first the probability of correct identification of a field 
interior pixel. Field interiors are nonunifom because of the combined 
effects of sensor noise, scaled to equivalent reflectivity (NEAp) ana inherent 
nonuniformities in the field itself. As these are restricted to approximately 
the same bandwidth, Che combined effect will be the root mean square (rms) 
sum. In addition, quantization noise is normally combined with the ocher 
noises to form the total noise figure. The overall brightness discribution 
will be considered to be Gaussian - this is approximately true tor field 
interiors, although Che distribution deviates considerably toward bimodal for 
mixed materials at field borders. 

The combined effect of these various noise sources produces a finite 
probability of misclassification. Precision of Che classification is greatest 
when Che known classes have tight statistics and are relatively widely 
separated in vector space. Addition of noise will cause che class statistics 
Co spread, so that Che separaCion/spread ratio decreases, and also causes 
uncertainty in the vector position of the unknown being classified. Ready et 
al C1971) have investigated this effect, and show in their Figure 12 
(reproduced here as Fig. 1) an experimental decrease in classification 
accuracy with increasing noise. 

The first-order estimate considers che total variance caused by the scene, 
sensor and quantization as compared Co the defined class size limits, however 
these are determined. Similar, but relatively second -order , effects may be 
expected with a higher order analysis. 

For simplicity, ana because of Che later desire to misregister one (or 
more) of the bands, Che discussion will assume that spectral bands as sensed 
will be used, and that for recognition, the unknown pixel must tall between 
appropriate limits in every band tested. Therefore, brightness outside of a 
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Limit in i»ny one band is sutticienc tor rejection, po tnau we need to consider 
only one band at a time. The resultant tirst-oroer multispec tral probability 
ot correct classification of a given pixel is the product ot the probabilities 
taKen one band at a time. 

This situation is illustrated in Figure 2, wnich shows the location in 
two-dimension decision space of tour typical materials, Decision boundaries 
are assumed to have bean determined from tne training samples for the 
materials according to some decision rule: Figure 2a illustrates boundaries 

from a maximum liklihood classifier, Figure 2b illustrates manually set 
boundaries ot a parallelepiped classifier as determined after consideration of 
the class spread and the tradeoff between the consequences of omission vs, 
comission. Figure 2c shows boundaries which closely surround the training 
cluster, as these might be used in ELLTAb or the Image 100. The analysis 
model is: Given a group of pixels having a brightness B 2 ^ distribution 

ot brightness in space, what is the liklihood that an individual pixel 
will have a brightness u. which tails outside the class limits in that 
band? The class limits in Xi are determined from the classification scheme 
used, tor the given brightness 82 , as sketched in Figure 2d. (Later, the 
brightness of tne center of this distribution in will be allowed to vary 
as misregistration causes mixed materials in this group ot pixels.) 

The quantized brightness level of a single pixel in a given spectral band 
has a finite probability of not being exactly at Che grand average brightness 
ot the field of which it is a sample (See Fig. 3). The probability ot a 
sample being within the class limits can be derived by assuming that an 
ensemble of clean signals from a series ot areas of the same material can be 
anywhere within the quantizing range with uniform probaoility, but individual 
samples are perturbed by the Gaussian noise with a distribution equal to (T . 
The probability distribution ot Che signal plus noise is found by convolving 
Che probability distribution ot the signal with chat ot Che noise. The 
probability ot correct class assignment (i.e., the pixel is within tne class 
limits) is Chen tound by integrating tne probability distribution between 
appropriate class limits. Friedman I I9b5 j . 

The probability of correct classification P thus calculated is given by: 

1 > » Gi-f - -iJX (i _ exp(-%B^)) 

^2 ^ 


where 


B =* class size/a , with class size and a in the same units 


The resulting curve is given in Fig. 4, which may be approximated tor ease in 
further analysis in the range Kp<7 by 


B log P * -0.40 
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As aLraady noceu, sources of noise will be the scene itseli: and ehe 
sensor, noth assumed to be ranoom tor this analysis. The rtits sum is taken to 
give Che total effective noise. A number of pixel measurements may be 
averaged together to reduce tne noise before classification. This final noise 
figure may be compared to the width of the class to give B .from which the 
probability ot correct classitication may be estimated. This leads to the 
Classification Error Estimator, Fig. 5. The mathematical form at this graph 
is: 


where 


B 


m 


class size S 
total variation 


n 


■ scene noise 
O 2 sensor random noise 
0'3 » sensor quantization noise 
n X n pixels are averaged together 


In this figure, the upper left graph represen ts the rms comoination of the 
scene noise or with the total sensor noise it *• / ct ^ + it| , the result being 
scaled to equivalent 8-bit digital number (dn) . The upper right graph 
represents the averaging ot n x n pixels to reduce the effective noise. The 
resultant total effective noise *^eff compared with the established 

class size Icenter, right) to give B » after which the curve from Figure 4 
(lower, right) converts this resultant g to the probability of correct 
classification, P. 

Consider a scene having a field interior variation of 3% as sensed by a sensor 
with a total noise of U. The total effective noise as seen by the classifier 
will be 

V 2 ? 

scene ’*'”^sensor “ 3.1fa« = 8.1 dn for full scale = 25b. 

Suppose that the class size tor this classification is 25 dn. With no pixel 
averaging, B =3.09, and the probability of correct classification is 

Blog P = -0.4/3.09 giving P « 0.742. 

Given that 74% is not accurate enougn, and that the noise and class size 
have been fixed by the scene and the classifier, pixel averaging ot (say) 2x2 
pixels will raise B fo b.l8, which in turn will increase P to 86%. 


The useful range of B seems to be from 3 on the low end, limited by 
acceptability of tne classiticacion accuracy obtained, to about 7 on che high 
end, limited by the scene and/or sensor noise. 

Considering B in tnis way allows an estimation of the total noise 
permissable as it affects tne attainable classification accuracy. If the 
amount ot scene noise to be encountered in a given classitication task can be 
estimated, the allowable extra noise from the sensor ano quantization can be 
specified. This leads to an estimate of che number ot bits which will be 
useful to be transmitted. This is developed in Appendix A. For the current 
discussion, noise will be considered to be che total effect from all sources. 
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EDGE EFFECTS 

To this point, the analysis is based on pixels well inside uniform fielas 
and well away from field boundaries. A number of experimenters have spent 
appreciable time discovering that classification accuracy falls off at 
boundaries due to what has become known as the mixed-pixel effect. We will 
start at that point and attempt to model the effect to allow us to quantity 
our expectations. 

We assume as a starting point that all the spectral bands used in a 
classification* whether obtained from one date or a series of dates, are in 
perfect registration. This means that when the pixel grids from each band are 
aligned the data contents (field borders, roads, all features) are also 
aligned - note Chat this is more than simply having all internal distortions 
removed, which is all that most geometric rectifications accomplish. 
Misregistration will (later) be considered as the lack of alignment of the 
pixel grids; since the computer can only work with pixel grids, this amounts 
to aligning these pixel grids. This will appear to the computer cs a shift in 
Che boundaries. We assume as before that training samples are accurate and 
that class limits have been set from these by Che classifier chosen. The 
classification is modeled as follows: signature shifting in any individual 

band will Cetivi to cause misclassification, so that we may treat Che situation 
one band at a time. The effects of pixel mixture in all bancs may then be 
combined together if desired, The entire analysis simplifies to the 
consideration of the transient intensity shift across field boundaries as 
Compared to the class limits and the noise components of the measurenient . 

For the case of one band being misaligned with respect to the rest, the 
resultant brightness shift due to the pixel mixture is along the spectral axis 
involved. The amount and direction of shift will depend on die brightness of 
the new material being mixed with the original, in that band. Figure 6 
illustrates this tor the case of a shift in physical position 
(misregistration) of band considering Chat the true field boundaries are 

correctly defined in band 1 2’ general case, the most correct 

position of the boundaries must first be defined, and the mislocation of 
boundaries in each of the spectral bands considered separately. The resultant 
spectral shift, no longer along any one spectral axis, may be broken into 
orthog.;,‘? components and the bands treated separately. For the case of 
interna; warping such as may occur for the Tliematic Mapper, rather than a 
simple lateral shift, some rras average of the amount ot pixel shifting must be 
estimated, after which an analysis along the lines herein may be applied. 

Tlie first step in analyzing the spatial extent of pixel mixing across 
borders is to estimate the shape and extent ot the transient intensity shift. 
If the impulse response functions or the MTFs of the various components (and, 
hence, the entire system) are known, a precise transient response may bo 
calculated. For example, the specifications for the Thematic Mapper for 

Landsat D call for a 2/4 to 98Z time equivalent to about 2 pixwls, implying a 
10-90^ transient response of about 1+ pixel (Appendix B) . If the true system 
impulse or frequency response is not known, the transient response must be 
estimated. For a band limited system Ax, the maximum allowed distance 

between samples, is given by tlie Nyquist criterion as Ax = l/(2fo), where 
1 q is the spatial frequency above which there is no signal content (Figure 
7). This is approximately the sampling condition of the Multispectral Scanner 
ana of the Thematic Mapper. Most practical systems will have a roll-off in 
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irequtancy response tor whicli an approximate transient rule ot thumb is 
t iQ-yg » l/(3t'jj), where '^10-90 oistance between the 10% and 

90/. response points and £'q is the nominal cutolf trequency at the 70% 

trequency response point (Figure 7). The shape ot the pass band .till be 
approximately Gaussian, scaled to the principal determining factor, the 
scanning aperture size. Uetining the resultant passband limit tg as the 

tirst zero point ot the sine x uetined by the scanning aperture at x * , the 

70% response point is at about x * 1.4 . Thus f 'g/fg^l. 4/ ■rr . 

bubstituting 1/2 Ax tor tg, and putting the resulting equation tor T gives 


1 

■^ 10-90 “ 


2 


Ax 


K 


1.5AX . 


While It is realized that tnere are several approximations in the above, so 
are there in tne usual basic assumption that data are Gaussian in distribution 
- both sets of assumptions are generally correct, and more precise 
mathematical formulation would not clarity tne issues. 

The practical result of this are: 

• The "intinitely sharp" edges (narrow roads, point sources) ot the 

real scene will be softened by the filtering ettect of the scanning aperture 

and It is this softened transient response which is sampled. 

« 

• Adjacent samples can never have an amplitude difference more than 
approximately one halt of the transient size. 

• The proper reconstructing function for a band-limited noiseless 

signal with extended objects is sin x/x; if these functions are erected at 
each sample point with corresponding amplitudes, the softened edge will be 

perfectly reproduced. (But see below.) Under these conditions, the same edge 
shape will be produced independent of the phasing of the actual samples. 

• The sin x/x reconstruction theoretically requires an infinite 

interpolation extent if the bandwidth is really finite, and vice versa. 
Thus, ± 00 of pixels is theoretically required in the reconstruction. Since it 
is incongruous to believe that pixels more than a certain distance (say, ±4 
pixels) have affected the sample at a given location, interpolation functions 
utilizing only a finite number of supports must be sought. Candidates for 
this are cubic splines, such as the TRW spline, B-splines (probably also 
cubic), splines under tension, and the Bendix restoration. The argument is 
that if there is no memory after a certain distance (the autocorrelation 
distance), pixels and, therefore, the interpolation ba.^is functions beyond 
chat range will have zero benefits, and tor practical systems, will only add 
extra noise (Appendix C) . 

• Substitution ot these various interpolation functions should have 
minor effect on the rise distance estimated here. 
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The transient situation across a boroer is sketchea in Fijj. 8a. ke are 
concerned here with the decrease in probability that a given pixel will have a 
value within the class limits as that pixel moves toward the boundary, as 
shown in Figure 8b. This etfect by itself will cause a bias in any classifier 
due to loss of boroer pixels. This loss is usually estimated by adding back 
into the area equivalent fractional pixels estimated from the amount of 
adjacent material mixing as determined from the amount of pixel brightness 
shift. Alternately, the loss may be avoided by proper choice of the class 
limits; the conditions for such an unbiased classifier are developed in 
Appendix D. 

The analysis, therefore, only needs to determine the area under the normal 
curve (assuming the noise is Gaussian) between the limits as determined by the 
classification class size and the offset from the "field interior value" 
caused by the mixture. Practical values of 0 (ratio of classification limit 
size to rms noise) will be bracketed by unacceptable classification accuracy 
on the low end (B”3) and unattainable signal to noise on the high end (B«7 
to lO) . 

First, consider that the class limits are accurately set so that the 
distribution of the field interior pixels is at the center of the class. Then 
allow the distribution center to shift (because of the mixture at a boundary) 
with respect to the class limits. The important scaling involved is the 
amount of signal shift caused by the transient total shift 1, as related to 
the desired class size S, for a given B • The set of curves of Figure 9a 
reflect the portion of the area under the noise component Gaussian 
distribution as that distribution is intercepted by Che class limits when the 
brightness center moves away from the class center as the pixel approaches Che 
field boundary. These curves are optimistic in that they assume that a real 
field average will always be centered within the class limits. This condition 
would only be attained with perfectly representative training samples, no 
atmospheric or oidirectional reflectance effects, all samples at the same 
pheno logic stage, etc. As these conditions normally cannot be expected 
simultaneously, the unknowns to be classified will likely be more uniformly 
distributed, so Chat the probability situation described for Che 
classification accuracy derivation is more realistic. When the limiting field 
center (zero offset) values are modified to agree with the curve of Figure 4, 
the basic curves of Che probability of correct identification as a function of 
the shift of intensity values result (Figure 9-b). 

The next question to answer is how rapidly (in distance) does the shift 
cake place? The transient rise distance estimated for Che Thematic Mapper 
(Appendix B) has very close to a Gaussian shape and a 1 lQ-90 " pixel. 
The amount of brightness shift is Che difference between the brightness of the 
field under consideration and the adjacent field which is causing the shiic. 
The important intensity relation is the magnitude of this shift, T, as related 
to the size S of the class being tested by Che ratio T/8. These curves, for 
various T/6, may be combined with those of Figure 9-b Co give Figure 10. From 
this may be estimated the loss in probability in classification of pixels near 
borders . 

The curves as given are optimistic in that they are based on the best-case 
transient rise distance. Any deteriorating effect, such as atmospheric 
scatter or turbulance, which injects light into the image of a given field 
from adjacent image areas, will raise the effective transient distance as well 
as cause a veiling haze (.the path brightness). A realistic estimate may 
consider tne transient distance to be perhaps 1.5 to 2 pixels; 1.5 will be 
used in the remainder of this report. 


blAS IN FIELD SIEE ESlIMiVTIUN 


IC can be appreciaced fcrora Figure 8 that several things are happening 
simultaneously: It the lower limit ot tield B and the upper limit ol field A 

have a gap between, pixels “lost" by field B will not be picked up by tield A, 
and will be considered unknowns ano not be counted in either field. The lost 
pixels will be some interior pixels, due to insufficient g , and a large 
number of near-border pixels, resulting in apparent tield size loss. Only if 
the lower limit of field B and the upper limit of field A are coincident will 
pixels lost from one tield be picked up by the other, and vice versa, to give 
complete account ot all pixels. For the field size estimator to be unbiased, 
the loss-and-pickup in both directions must cancel; that is, on the average 
the true border must be located. 

The various estimations of the transients across borders predict 
negligible phase shift; that is, the 50 % response point occurs at the true 
boraer. This implies directly chat the true border can be located from the 

data if Che decision brightness is midway between the brightnesses on either 
side of the border. However, even it phase shift lag causes border 

displacement, the 50% response point still represents the unbiased condition. 
It the lag is widely different in the various bands, relative (between band) 
border shifting will occur. For this reason, the transient response ot the 
various bands should be matched as closely as possible. It is likely that 

this 50% setting of the decision brightness will not be the same as would be 
calculated by a classifier; indeed, it cannot, as Che tield edge decision 

brightness will vary even for a given tield as the adjacent fields vary in 
composition. This leads directly to the required algorithm for field size 
escimaticn: First divide the scene into blobs, each of which is sufficiently 

uniform, and with closed boundaries. Then for each blob (field) determine the 
average brightness tor all Che interior pixels which are safely away from Che 
border. For each segment of the border, the correct tield edge decision level 
is midway (in ct's) between the average brightness of the two fields on either 
side. Then, if desired, the field interiors may be reclassified using the 
classification limits as determined from the training samples. 

The total effect will depend on the ratio of the number of border pixels 
to the number ot f lela-inCerior pixels, and hence is a function of the field 
shape and size. This is developed in some detail in Appendix D. The 
important conclusions which can be drawn are: 

• For accurate tield size estimation, the decision brightness must oe 
halfway between the brightnesses of the fields on either side of a 
given boundary. This means that classifiers set for material 
Identification will in general produce errors in field size. 

• Correct Nyquist sample spacing is about equal to 2/3 ot the 10%-90% 
transient distance. But relaxation to about equal this distance 
causes little problem. 

• It is important to keep the transient distance small, to get as many 
pixels into a given ground distance as possible. Field area errors 
become large at n^ = 5 or less. The transient distance must also 
be matched between spectral bands. 

• It is most critical to keep the decision limit midway between the 
aajacent-f ield brightnesses tor small fields. But this is 
increasingly hard to estimate for small fields because of the fewer 
interior pixels. 
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• At Che resolution expected tor Che Thematic topper, the atmospheric 
point spread function may become more dominant chan the Thematic 
Mapper point spread function. It this is determined to be true, the 
registracion requirements may be relaxed. 

• The total noise tigure (compared Co the class size in a given 

determination) controls S , and in turn controls the maximum 
attainable classification accuracy. However, tor practical range ot 

3 < H < 7, increasing P has only a moderate effect. 

• Because ot this, if small fields are most important, the reflected 
energy might more profitably be divided into smaller pixels, even at 
the expense of NEA^^ . As this will cause an increase in uata rate, 
optimum coding should be investigated, The possible noise introduced 
in reconstructing the data will cause some further decrease in the 
overall effective NEAp and so decreases p . But since there is 
smaller sensitivity to p than to l/ni, there should be a net gain 
in utility, 

• Oeometric rectification and registracion procedures must not only 
remove the internal distortions but must also produce pixels on a 
defined (preferably ground-referenced) grid. Current procedures do 
not do this. Without this reference grid, users will have to 
reinterpolate before multi-temporal data can be compared. 

EFFECTS OF MISREGISTRATION 


In preparation for estimation of Che misregistration effects, an analysis 
will first be made ot the expectations of registered data and the sensitivity 
to Che various parameters estimated. Pixels will be grouped into tour zones j 
1) Interior - those with centers 2 or more pixels inside borders; 2) Inner 
border - pixels with centers 1-1/2 pixels inside oorders; 3) Outer border - 
pixels with centers 1/2 pixel inside Pordcrs; 4) Exterior border - pixels 
outside the borders, with centers 1/2 pixel outside. This is illustrated in 
Figure 12. Estimates of classification accuracy for each zone will be 
obtained from Figure 10. The total estimate of calssitication accuracy will 
be the sum of pixels in each zone multiplied by Che corresponding zone 
accuracy estimate. Later, the field will be misregistered , changes in Che 
number of pixels in each zone calculated, and the probabilities again summed. 


As before, define 


n-^ =» number ot pixels in field short side 
n 2 ** number of pixels in field long side 
r * n 2 /n^, Che field shape ratio 


The numbers of pixels is: 



; Interior 

Cn--4)(n„-4) * (rn 
Not 

"ib 

; Inner Border 

2(nj^-2) + 2(o2-4) 

*^ob 

; Outer Border 

2(np + 2(n2-2) » 

"xb 

; .Exterior Border 
Total Area 

2(r+l)n. + 4 
2 ^ 


“ rn^ - 4Cr+l)n^ + 16 
* O^for HjA 4. 

-12 ( njS 4 ) 
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Note chat t may vary over a significant range, and that the later 
misregistration analysis applies to a pair of field images having precisely 
the same r value and precisely the same size, but displaced one from the other 
without rotation. 

The total probability of correct classification is given by 
P » ( p^n^ + + Pobiojj + Pxb^xb ^ 

The details are worked out in Appendix £. 

From the derivations in Appendix E, tne following global values may be 
selected for the parameters: 
r “2 
T/S » I to 5 
T * 1.5 pixels 
p "5 

The resultant probability of correct classification for the global set of 
parameters is given in ligure 11 (derived from the general curves of Figure 
10 ). 


After the parameters r, T/S, t , and P are selected, the resultant 
probabilities may be substituted for the brightnesses in the various zones to 
produce a ''probability image" aligned with the desired output pixel grid. The 
probability assigned to a pixel at a given location represents the probability 
that chat pixel will have a brightness falling within Che classification limit 
determined by the classifier, tor the given spectral band. The total 
resultant per pixel probability image tor the effect of q acquisitions is 
obtained by superposing the q probability images and multiplying together the 
set of q probabilities for each pixel point. An acquisition is defined to be 
each band or data set used simultaneously. Using these values, Che global 
estimate of the probability of correct classification tor one acquisition with 
no misregistration is given in Figure 13 for three values ot T/S. The 
predominant effect is the pixel mixture (the effect of T/b) . This is worst 
for small fields (n^ small) because of Che larger percentage of border 
pixels lor these fields. Note that tor T/S “ 1 (the same as 6 » 1/2 of 
Appendix D) , no probability loss occurs, even with small fields. 
Unfortunately, this desirable condition cannot be systematically obtained. 

MISREGISTRATION OF CONGRUENT FIELDS 

Figures E-5 and E-6 , and the accompanying discussion in Appendix E describe 
the misregistration situation. The essence of the result is that some area is 
lost from tlie external border, causing a further classification decrease. The 
basic character ot this misregistration loss term is 1/n^, so Chat it will 
have a slope approximately equal to -I on a log-log plot vs n^. The precise 
results depend critically on the values estimated for Che Pjj|j from Figure 
10. The values chosen leading to Figure 14 are given in Table £-4. 
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Note first that for large T/S (i.e., 2 or more) the eoge etfects are so 
great that the base probability is drastically affected (Figure 13), and that 
the external boroer pixels have zero probability of being within the class 
limits (Table E-4). For this reason, there is no misregistration effect for 
large T/S, and Figure 14 is plotted tor only T/S " 1. 

Comparison of Figure l4-'a,-b, and -c shows the effect of changes in r, 
with the square fields showing the most misregistration loss. As baiore, r*2 
is believed to be representative. 

Comparison of Figure l4-b and -e shows the effect of increase in P . The 
decrease in misregistration loss with higher B is evident. However, these 
losses in general are small to begin with, and the previous discussion calling 
for sacrifice o£ B to gain smaller IFOV (more pixels into a given field) 
would seem to override. 


Figure l4'-d indicates the etfect of transient rise distance, Various 
moaels ot the expected aistance yield Kt< 2, and this range is included in 
the figure. Increase in t decreases the basic accuracy of ecge pixels (Figure 
ll) and also increases the misregistration losses (Figure l4-d) . 


The curves of Figure 14 may be (even further) approximated by AP * K 
where K is given by; 1 



l.O 

1.5 

2 

3 

18 

23 

35 

5 

6 

8 

12 


and T/S * I, r « 2. 


COMPARISON TO OTHER STUDIES 

1. Proportion of Mixed pixels 

The mixed pixel problem has long been recognized. Nalepka and Hyde (1972) 
estimated the percentages of mixed pixels in agricultural scenes to be between 
20 to 40 percent, depending on the field sizes. Pitts and Badhwar (1980) 
obtain about the same percentage for field in the United States Great Plains. 
Chittineni (l 981), in estimating the proportion of components within each 
mixed pixel, also finds a large proportion of mixed pixels in a scene. 

2. Scene Segmentation and Blobbing 

In addition to the blobbing referenced in Appendix D, Bryant (1979) and 
Kettig and Lanagrebe (1976) have attempted to segment scenes into "pure" 
pixels and mixed pixels. 

3. Size and Shape Parameters Verification (n^, r) 

Figure 15 provides the scaling needed to put the prior analysis into the 
real world, relating Che field width n]_ to area for the assumed shape 
ratio r “ 2, for Thematic Mapper pixels. 

It has been shown that r “ 2 is a good model for a range ot ratios. That 
this also represents the real world is shown in the table below, using 
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data tiom PodwysocKi's report "An Estimate ofc t’iela bize Distributions tor 
betected Sites in the Mjor Orain Proaucing Countries", GSFC X-y23-7b-93. 


Location 

Shape Ratio 

5UA-of-tields 
wid th“Pixels 

50%-of-area 
wid th-Pixels 

Kansas 

1.71 

8 

9.5 

Iowa 

1.78 

9 

10.5 

Saskatchewan 

1.98 

ll 

15 

USSR 1 

1.39 

25 

31 

USSR 2 

1.53 

20 

30 

PRC 1 

1.48 

7 

9.5 

PRC 2 

1.37 

8 

9.5 

France 

1.44 

5 

6 

India 

Average except 

3.09 

2 

5 

India 

1,58 

a. 6 

15.1 

.. .» ^ ^ j 


Given the relative insensitivity of the results to the ratio r, the assumed 
r “ 2 would seem to be representative. 

Also from Podwysocki's report (Figures lb and 17) the area anu field width 
distributions are given in the table. It can be seen that (except for the 
many small narrow fields in India) the SOiS-ot-tne-tields width is about 12 
pixels and the 50%-ot-the-area width is about 15 pixels. However, to 

accurately estimate the total area, many more of the smaller fields must be 
classified. Taking the 10%-of-the-area point as the limit, the field width is 
about 8 pixels. Thus the field size range of primary interest is from about 5 
to 30 pixels. Again, India is low, with a lOX point at about 2 pixels. 

The Hughes"Thematic Mapper User Sensitivity Study Report" (July 1980) 
considers only one field size and shape, 14x14 pixels, in the simulation. 
Given the field size distributions from Ponwysocki's report, this 14x14 size 
does not seem to include the more critical small sizes. For this reason, the 
combined bulk and boundary classification losses quoted are optimistically low. 

4. Adjacent Field Transient Urightness Shift Verification (T,S) 

There is no data for the amount of transient edge shift to be expected. 
An intuitive feel is that during the growing season all fields will be 
vegetated, either with the crop of interest or an alternate crop. To the 
extent that "green is green", and except during the period when different 
crops have widely different phenologic stages, the signatures will be 
similar. Therefore, the decision limits must be close to the field interior 
averages, and the fi will be low. Also, in this study no account has been 
taken of field edge artifacts: hedgerows, roads, ditches and the like. For 
small fields, these may introduce appreciable further error, as they will form 
an appreciable part of the mixed pixels. 

The Hughes report lists the set of 8 classes used in the analysis. 
However, no data is given tor the class brightnesses nor the variance used in 
Che simulation. Thus, there is no way to estimate the fi which was 
effective during the analysis, nor the adjacent field Cransient brightness 
change T , nor Che effective class limits b . however, the statement is 
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maae "...the classes used for ctus sinmlacion are inherently more separable 
than cnose used by GEi" This woula imply Che use of a high B » This is 
verified by Che unusually large within-tield accuracy obtained - in Che >95^ 
range, requiring a 6 of lU or more. 

The Hughes report correctly states "Depending on the threshold values used 
... results could be better or worse than those obtained by performing 
multiple class classifications. Finally, Che reauer is cautioned to use Che 
classification accuracy only in a relative sense to compare effects of the TM 
parameters. The absolute classification accuracy losses will be dependent 
upon the particular scene and the statistical separation of the classes being 
investigated." Although this is true, simulations should approximate reality 
to be useful. The GE study, which concentrates on agricultural subscen ., is 
more representive of the tasks (e.g. LACIE, AGRISTARS) for which 
classification accuracy is most critical, and the larger effects of the TM 
parameters and misregistration are probably more realistic. 


5. Adjacent Field Transient Distance It) 


The necessity of maintaining a low transient distance i is illustrated by 
Figures U ano Iq-d. This distance is the result of the convolution of Che 
optical blur, diffraction, detector size, and filter. The Hughes report shows 
tnat "...convolution with the relatively large square TM detector and the 
electronics modulation transfer function U-iTF; virtually eliminates any 
differences created by various shapes of blur... The largest degradation in 
image quality is caused by slow rise times or mixtures of filters." All of 
the filters simulated show a r 10-90 about 1.3+ pixels, except the 
undershoot filter, which had a T iO(»-yo of 2. A pixels (Hughes Figure 5) * The 
insensitivity to the exact filter shape lexcept the undershoot) justifies the 
use of a Gaussian rise as surrogate. The assumed global value assumed here 
^10-90 " seems realistic, although perhaps a bit optimistic. The 

potential further blurring by the atmosphere, which will affect the total 
system performance, is neglected as it is not a hardware ps’ameter. 


The large transient distance, large delay (about .8 pixels more than the 
protoflight or ideal filters), and slow settling time of the undershoot filter 
reinforce the Hughes statement as to the need to match filters, and, in 
particular, to avoid the undershoot case. 


It is shown in this study that edge pixels attected by adjacent fields are 
Chose within 2 pixels of the boundaries. The Hughes study considers boundary 
pixels to be tnose within 3 pixels of a boundary. Although the difference 
seems small, this is an appreciable part of a small field, and includes one 
row of pixels which is not affected except at very large T . Therefore, the 
loss in accuracy of boundary pixels is underestimated in the Hughes report. 


6. Type Of Misregistration 


The misregistration errors can be classed as displacement or distortion, 
depending on whether the area of interest is mislocated or distorted. 
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borne typical causes are: 

Displacement - tne area of interest is mislocated 

Scan velocity cnanges 

Alignment Errors 

Warm to Cold Focal Plane 

Platform Vibration 

General mapping projection on round earth (Nultiteraporal problem) 
Distortion - the area of interest is distorted 

Scan Line Corrector 

Nearest Neighbor Interpolation 

Altitude Relief Displacement (Multitemporal Problem) 

Total misregistration effects, whether caused by displacement or 

distortion) will be considered (without mathematical justification) to be the 
rms of Che various individual causes, and treated as a displacement, (a brief 
discussion of this is given in Appendix E) . 

Band to band displacement as caused by scan velocity changes from nominal 
may be estimated from Che focal plane layout. Figure 18. Considering Band 
4 as a reference, the distance in pixels to Che other high resolution bands is: 

Band 57321 

Distance 7l 45 25 50 75 

The rms of these distances is 125 pixels. Considering that Che scan 
velocity may be 0.5^ away from nominal, the resultant rms displacement of the 
bands is l/2%xl25 = 0.52 pixels. If the scan velocity error cannot be 

maintained to considerably below this value, band to band registration will be 
required. If the velocity can be measured or modeled (even if not 
controlled^, this registration may be done by calculation; if not, band to 
band correlation will be requirea over the face of each scene. The 

criticalness of at least measuring the velocity is evident. 

The various alignment errors presumably can be calibrated once and applied 
to the total image (although perhaps on a band to band basis). 

Platform vibration will add an additional apparent scan velocity component 
when the IFOV is projected to Che ground. Considering that Che IFOV scans at 
a rate of about 0.13 rad/30 msec = 4.5 rad/sec on the ground, holding the 
spacecraft vibrations to (say) 0.1% effective change in scan velocity to 
minimize the band to band displacement requires holding Che vibration velocity 
peaxs to about 5 mraa/sec. Again, the utility of measuring or modeling this 
velocity is evident. 

Displacement errors of a different type affect the ability to easily do 
multitemporal overlays. These are the large area low spatial frequency 
warping distortions, caused by the projection to the earth (altitude, 
attitude), scan velocity integrated effects, and spacecraft vibration. These 
can be moaeled and a first order blind correction made, it sufficiently 
precise measurements are made. The GE report "Geometric Correction ^latrices 
for Thematic Mapper", dated 4/22/80 covers this in detail. Of interest to the 
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eventual data user is the grid to which the data are transformed! "Output 
scene coordinates are an XY system with origin at the scene center. Direction 
of Landsat-D motion is in the direction ot the -Y axis." Because of this, 
each scene will be on a different grid, and even nominal repeat scenes will 
not have common grids. Assuming that all of the warping has been correctly 
removed, tor many uses users will still have to resaniple to common grids, 
dealing with translation, rotation, scaling, and recasting to the desired 
working map projection. 

Relatively few, if any, users will be in a position to pertorm all of the 
corrections outlined in the GE report. Thus, if the images are properly 
prepared and archived with complete enough annotation that data from a given 
ground location can be expeditiously extracted, there should be relatively 
little tratfic for the raw data. Because of the independent grids, o£ top 
importance is that the system-induced internal warping be removed from images 
supplied to users to minimize the further detailed correlations which the 
users must perform for registration and to allow the production of undistorted 
film images. 

Instrument-associated distortion removal is discussed in the GE report, 
but no feel for the adequacy is available. The deleterious effects of nearest 
neighbor interpolation are discussed in Appendix C of this report. A 
remaining prime distortion effect is that caused by relief displacement as the 
altitude of the ground at various places in the scene changes. Although not 
an instrument problem, it will be a system problem. The construction is given 
in Appendix F and the net result in Figure 19. The essence of the argument is 
that, without relief displacement correction, map accuracy at 1:24000 
(approximately 0.4 pixel) data location and image registration to fractional 
pixel cannot be met in areas ot high relief. Thus, tor precision work, many 
users must perform precision correlation and interpolation over the entire 
face of an image, and/or use pixel-scale altitude data in an open ended 
correction. 


DATA UTILITY 

This report is an attempt to model the potential misregistration effects 
on multispec tral classification accuracy. It may allow the comparison of the 
various tests and simulations, and points out the variables which must be 
reported for those simulations to allow their validation. Unfortunately, most 
simulations are not accompanied with sufficient data to provide understanding 
of the precise conditions under which the (e.g.) classifier operated. Because 
of this. It is difficult to judge the degree to which the simulation is a good 
surrogate for the real world, and difficult to compare the simulations. 

For many purposes, multispectral classification is not the required oata 
analysis procedure. Although many of the concepts herein may be applied to 
other analysis procedures » other factors not considerea may overshadow 
conclusions reached by this modeling. For example, precision location of 
boundaries in the image for mapping purposes will invoke a different set of 
criteria. Image feature interpretation, especially of features in which Che 
brightness does not have abrupt boundaries, will also be different. 
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filially, the analysis does not answer the bottom line question even il: 
multispectra I classifcation is the required procedure: Given a certain loss 

in accuracy, how uoes chat aumai,e Che ability to use the data analysis 
results? Or: Given that Che system produces data Co a certain registration 
accuracy, how much auoitional work, by how many users, will be required to 
bring Che data to sutticiently good registration tor leach ot) their needs? 
These evaluations will be discipline dependent, ana must be sought separately. 
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TRAINING SAMPLES 

— TEST SAMPLES 

— WHEAT (TEST) 

SOYBEANS (TEST) 


Figuro 1, 


Effect of random noise on the accuracy of multi-spectral 
classification. (From Ready 1971) 
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Figure 3. Effect of noise on tlie probability of correct multi- 
spec tr al c la s s if ica t ion . 



Figure 4 Given a signal uniformly probable over the dynamic range. 

Gaussian noise of valuer o. The cutvce show probability 
of correctly recognizing a class corresponding to the noise 
free signal as a function of the ratio 6 «• class sire / a . 
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Figure 2. Location of typical materials in 2-dimensional decision space, 
with decision boundaries as determined by a) (Upper Left) 
Maximum-likelihood; b) (Upper Right) Parallel piped Box; 
c) (Lower Left) Defined Clusters, as in ELLTAB or Image 100. 
For a given brightness B 2 in spectral band class 

extent for "Grass" in band X^^ is shown for each classifier in 
Figure 2d (Lower Right) , together with the location in band X^ 
of the distribution of pixels of a group located at position 
"x". 
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ORIGINAL- PIXELS ON 
FIELD INTERIOR BORDERS 


Figure 6a (Left) Misregistration of Hand 

causes the brightness in this band to 
vary. For one band misregistration 

(in this case, Xj) the intensity 

variation will be aligned with the 

spectral axis in decision space. 


Figure 6b (Right) The amount 
and direction of spectral shift 
depends on the mixture, deter- 
mined by the adjacent material. 
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Figure 7 Construction for Relating Spatial Frequency to Transient 
Distance 
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h r?ro»« 6*ctlon of brlghtnesa trace acroaa a boundary batveen 
two flelda, ahowlng tha dlatanca required for the bright- 
neae tranaltion. 



"S zs s s 


Figure 8b distribution of "field" plaels moves down the transition 
curve as the measurement point moves toward the boundary. 

The shaded area is the proportion which in.ll be correctly 
classified. 
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Figure 9a Loss in Probability of Correot Classif Icallon as the 
Brightness of a ('.roup of Pixels changes - curves for 
Precise Centering of the Initial (5roup within the 
Glass Limits 



Figure 9b Loss In Probability of Correct Classification as the 
Brightness of a Group of Pixels Changes - corrected 
Curves 
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PIXEL CENTER DISTANCE FROM BORDER - PIXELS 


Fipure 10 Combined Curves for Translation of Pixel Distance from 
Border to Shift in the Brightness (Scaled in Units of Class 
Size, S ) to Probability of Correct Classification. 
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DISTANCE OF PIXEL CENTER FROM BORDER - p!x«li 

Figure n Probability of Correct Clasalflcatlou of Individual Pixels 
as a Function of the Distance of the Pixel Center from the 
Field Border 



Figure 12 Construction for Estimating Total Effect of Border Pixels, 
with Perfect Registration 
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Figure 13 Probability of Correct Classification using Global Parameters, 
for Perfectly Registered Pixels. One Spectral Band Only. 
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Figure 14 Loss of Classification 
Accuracy due to Misregistration of 
One Band, for Various Parameter Com- 
binations. 

g » Class size/ a of noise 
r * Field Shape Ratio, long/short sides 
X “ 10-90% transient distance 
nj *• length of short side, pixels 
d ■ displacement, pixels 
AP » loss in probability 
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Flpure 15 Field Area Scallnp for 30 x 30 Meter Pixels and Shape 
Ratio r ■ 2. 

FIELD WIDTH - THEMATIC MAPPER PIXELS 
1 10 too 



Flpiire IS Cumulative Frequency Distribution (In %) of Field Width 

vs. Total Number of Fields. From Podwysockl, GSFC X-923-76-93 
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FIELD WIDTH - THEMATIC MAPPRR PIXELS 
1 5 10 50 100 



FIELD AREA (HECTARES) 


Figure 17 Cumulative Frequency Distribution (in %) of Field Area vs. 

Tot?l Cumulative Area. From Podwysockl, GSFC X-923-76-93 
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Figure 18 Detector Prolectlon at Prime Focal Plane (From, Hughes, 1980) 
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Figure 19 


NADIR SEPARATION - Km 



Relief Displacement Shift as a Function of Nadir Separation 
for the Thematic Mapper on Landsat D, 
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APPENDIX A - INTENSITY RESOLUTION 


In Che absence of all noise (i*e., no sensor noise and absolutely uniform 
scenes), liner intensity quantization would allow more subtle 
identifications. One estimate of usable quantization may be obtained by 
defining that "digitizing to m bits" means adjusting the quantization step 
size so that the scene plus sensor noise present causes the (m-t-l)th bit to be 
correct only 50J4 of the time (a subject tor debate in itself). This is 
precisely the same problem discussed previously for the accuracy of 
•multispectral classification ana illustrated in Fig. 4 (Billingsley, 1975). 
Referring to that figure, we can see that 50^. probability occurs with a g - 
1,5 (for the (m+l)th bit) giving a g - 3 for the mth bit. For sensor nns 
noise of 0.5% (Che TM specification) and no scene noise, Che number of useful 
levels IS 1/(3x0.005) ^ 66 . On this basis, quantizing to 6 bits (64 levels) 
IS justified, and the probability of zero error is about 0.7. This analysis 
may oe modified it it is known that the noise varies with the brightness of 
the scene, but Che approach is still valid. 


Another estimate of the number of useful bits may be obtained by 
estimating the loss of accuracy of the classification caused by quantization 
error. Define the perfect sensor as having no random noise nor quantization 
error (i.e., an infinite number of bits). This will define 




o 


class size • n 

CT 

scene 


and 




For the real sensor, 3 <6 because of the finite and 

O’ quantization’ probability of correct classification P is related 

to Pg by; 

p B p 

o 

The loss in classification accuracy AP = Pg ~ P thus depends on Pg and the 
ratio 



where 

is the scene noise 
o ’2 is the sensor random noise 
0 ^ 6 ne quantization noise. 

The entire escimation boils down to estimating the Pg required, which 
sets 3 g, and then partitioning the noise components to satisfy the desired 
A P. Note that unique explicit partitioning cannot be defined because neither 
the scene noise (pixel-pixel variation) nor the class-to-class separation, 
which influences the class size decision, can be controlled. However, to be 
useful, a classifier should have an accuracy in the 70-90% range, requiring 
2.5 < 3 <7. 


A plot of the loss in classification accuracy vs Pg is given in Fig. A- 1 , 
for the parameter families 3i/3 and 02/01 . Noise allocation starts with 
the definition of the desired Pg and ascertaining that the required 
3 g can oe obtained. Definition of the allowed AP determines (e.g., from 
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the graph) the allowed j,ens or/ ^ scene* estimation ofc the scene norse 

tor which the other conditions apply allows the calculation ot the total 
sensor noise allowed. The tinai step is to partition this noise between 

sensor random noise and quantization noise. 

for example, let Pq = Then Pq ® -0.40/log(0.85) ■ 5.b7. If we 

allow a 2/. loss in accuracy due to the sensor, CT 2 /ai « O.b from Fig. A-l or 
its related equation. It the expected scene tor these accuracies has a noise 
(non-uniformity) » 2%, the sensor can have a noise figure cr j * 0.6 x 

2% " 1.27.. It the NEAp ( ranoom) of the sensor is expected to be 1%, the 

allowable O'quantization “ ^ 1 .2%^-l/.‘^^ ~ 0.66%. This can be met with 

6-bit quantizir.g, for which O'quantization “ 1/64/12^ “ 0.45%. (Fig. A-2) . 

For the assumed conditions (Pq * 5.67 and * 2%), the required 

class size must be at least 11.5% (5.67 x 2%) of tull scale for individual 
pixel classification. If this causes confusion between classes due to overlap 
ot class boundaries, several pixels must be averaged (i.e., n> l) to reduce 
O' scene effective* secondary criteria such as the per field classifier 

must be used. 

Two final observations', (1) Increasing the number of bits of quantization 
produces improvements which asymptotically approach zero, as each successive 

bit reduces the step size by a factor of 1/2. (2) A scene having as little as 

2% variation is a very uniform scene. Since this noise is rms'd with the 

sensor noise, it will overwhelm any but a very noisy sensor. Therefore, tor 
purposes ot multispectral classification, an extreme number of bits would seem 
to be unnecessary. 

Essentially this same conclusion was reached by Tucker (1979) through a 
very dittarent analysis route. he concluded Chat a 2% improvement could be 

obtained with 7 vs 6 bits, and a further 1% by 8 bits. 

ketere nces: 

Billingsley, "Noise Considerations in Digital Image Processing Hardware," 
Chaper in Picture Processing and Digital Filtering , T.S. Huang, ed.. 
Spring er-Verlag , Berlin 1975 and 1979. 

Tucker, C. J., Radiometric Resolution for Monitoring Vegetation: How Many Bits 
are Needed?, NASA Technical Memorandum 80293, GSFC, May 1979. 
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APPENDIX B 


TRANSIENT RESPONSE DISTANCE 


The current specitications for the Thematic Happer step response are: 


scanning a step function (edge) of input radiance of any magnitude within 
the usable range of the TM shall produce a response satisfying the 
following criteria: 

a.) The overshoot/undershoot shall be limited to a maximum of 10!S of 
the step size. 

D.) The step response for bands I through 5 and 7 shall be settled 
to within 1.5 percent of the final value within 30 microseconds, and to 
within I percent within 60 microseconds after start of response. Band 6 
step response shall be settled to within 1.5 percent of the final value 
within 120 microseconds, and within I percent within 240 microseconds 
after start of response. Start of response is defined as the time when 
the response level change exceeds 2 percent of the step size. 

c. ) The rise time for bands 1 through 5 and 7 shall be less than 20 
microseconds. The rise time for band 6 shall be less than 80 microseconds. 
Rise time is defined as the time interval for the response to go from 2 
percent of step size to the first point in time that the response is 2 
percent of step size away from the final steady state value. 

d. ) The variation of response time between channels of each band 

shall not exceed +- .5 microseconds and shall be characterized as 

"systematic" or "random." Response time is defined as the time difference 
between the 50% points of stimulus leading/ trailing edge and the video 
channel output leading/trailing edge. 


The critical portion of this specification for the present discussion is 
paragraph c) . As the sample spacing for the TM is about 9.8 psec, an 

equivalent statement for rise distance woula be "...2% to 98% in 2.04 
pixels." The optical blur, lens diffraction, detector size, electronic filter 
and atmosphere blur, convolved together, will provide the total transfer 
function for the system. Neglecting the atmosphere, the effects of the 
remainder may oe estimated as follows: 

The elements affecting the response in the along-track and cross-track 
directions are; 

LENS/OPTICS APERTURE Along Track 


LENS/OPTICS APERTURE SIGNAL FILTER Cross Track 


A rectangular aperture (FB-1) 30 x 30 meter as projected onto the ground 
is swept across track at uniform velocity. For development of the transient 
distance the sampling is immaterial, although sampling will be accomplished at 
one sample per aperture distance (IFOV). Real detectors may have an intensity 
response which is quite uneven across the face of the detector. This will 
cause the transient rise and the accompanying MTF to deviate from the ideal 
case. These effects will be ignored, and the aperture modeled as having 
uniform response. (Fig. B-4) . 
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Tn« opcies are assumed co have a Gaussian point spread tunction with a 
standard ueviation ot about 6 meters. ^Hughes has found that the precise 
shape ot this function is unimportant.) (Figure B-2) 

The total along track point spread tunction is the convolution of the 
aperture and optics functions, as shown in Figure B-3. The response ot this 
combination to a step tunction ot ground brightness is the convolution of this 
point spread tunction with the step. This will be a ramp of duration about 
equal to the time tor the detector to sweep one IFOV, blurred by the optics. 

In the across track (along sweep) direction, the electronic filter 
response further affects the rise time. In the Hughes simulation, the blurred 
ramp was input to the filter and the response measured. The results showed 
that, except tor the case of an undercompensated filter, the output rise time 
was about 13 sec, or 1.33 pixels (lO/i-90X, from the graphs). The 

accompanying 2%-y8% rise time was about 20 sec, the specification limit. 

The related filter delay (in the Hughes simulation) was about 16 - 20 
U sec, depending on the specific filter. Variation in this delay between 
filters will cause a variation in the apparent position of field edges, both 
between detectors within one band and between bands. To minimize registration 
trauma, these should be matched. At the very least, the measured delays must 
be known to allow compensation. 
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Figure B-2 ERIM TN Simulation - 
Optics Gaussian Blur 




Figure B-3 ERIM TM Simulation - 
Along Track Spatial Weighting 
Function 


Figure B-4 ERIM TM Simulation - 
Cross Track Transient Response 


lU 

1/1 

z 

o 

8s 

Ul 


Ul 

> 

5 


a: 


1.0 

0.8 

0,6 

0.4 

0.2 

0 



B-3 


•30 -15 0 15 30 45 

DISTANCE - meters 







APPENDIX 0 


INTERPOLATION OR RESAMPLING 


The problem of resampling arises whenever pixels are required in an output 
grid at locations different from where input pixels are located. Many methods 
of new pixel value generation are available; however, computation cost 
considerations tend to inlluence heavily the choice of methoa. The simplest 
method is called "nearest-neighbor resampling" and it assigns a value (dn) to 
a new output pixel location (i, j) according to the value dn^ £ of the 
spatially nearest pixel (k, £ ) in the input array to the precisefy desired 
input location (x, y) . The closest pixel (k, £ ) is found by: 

k *= integer part of (x 0.5) 

£ “ integer part of (y + 0.5) 

so the dOj^^ j “ . 

This method preserves the exact value of pixels in the input data set, and 
thus introduces no new spectral classes; but it introduces spatial shift 
errors such that the local geometry may be inaccurate by up to \/? of the 
instantaneous field of view (IFOV) > or the size of a pixel on the ground. 
Worse yet, the pixel from which the gray level is derived shifts suddenly in 
location from the pixel just before the correct resampling location (x, y) to 
the pixel just alter it. This problem becomes critical during digital 
multitemporal picture comparisons because, while the registration of detail in 
the two images may be perfect in one location, elsewheie there is 
misregistration. Since the contribution of a given pixel is constant whenever 
Che output sample is to be drawn from d: 1/2 pixel spacing, and zero outsiae of 
that range, the average frequency response is sine x, with a first zero at one 
sample/cycle lFig.C~l). It thus imperfectly filters the sampling sidebands, 
and has appreciable attenuation of the high baseband frequencies. The visible 
effect is the "blocky" appearance of images interpolated by this method. 
Nevertheless, tor many purposes this interpolation is adequate, and may be 
accomplished with insignificant cost since only address-rounding is needed tor 
each new pixel assignment. 

A smoother approximation to the assumption of continuity is obtained when 
the adjacent pixel is allowed to influence the estimation of between-pixel 
values. When only tlie adjacent neighboring pixels are used, only a 

first-order (bilinear) interpolation is possible. In bilinear interpolation, 
un£^j is found by using an interpolation scheme with the four nearest pixels 
surrounding the resampling location (x,y^ to aetermine the dn at (x,y) (see 
Fig.C-2). If (x,y) lies between samples k and k + 1 and lines £ and £ + 1, 
then the gray level at (x,y) can be found by using: 

^«xy “ [(x-K) (dn£^k+i"'^"Ak) " (‘InA+l ,k+l-'^*^£+l , k-*] 

which reduces to 

dn^y = (x-k) (y-^)[dn£^g+X + dn^+i^g - dnj,j^ - dn£+i^k+x] G-2 

Since ttie contribution of a given pixel falls off linearly with distance tc a 
distance ±. one pixel spacing, (+a), the average frequency response (H) in one 
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dimension can be found by taking Che fourier transform of the triangular 
convolution kernel Ih); 


h 


bilinear 




H 


bilinear 


ai 



-2af X 
he ^ dx 


where fjj is Che frequency in the x direction and a is the sampling 
interval. Alternatively, hbixinear recognized as the convolution of 

two rectangular functions. In either case, the resulting frequency responfc 
is found Co be stnc^x, with the first zero at one sample/cycle. Resultant 
images are much smoother Chan chose from nearest-neighbor interpolation, have 
about 1/4 the mean-squared resampling error (Shlien, 1979) of nearest 
neighbor, and requites appreciably more computer time, primarily because of 
the tour multiplications involved. 

Accuracy can be improved further by increasing the oumbet of pixels in the 
vicinity of the resampling location from the nearest 4 to the nearest 16 14 x 
4 matrix) or more. Tlie additional points ofier an opportunity not available 
using the simpler methods to shape the pass band by adjusting the relative 
contributions (.via weighting factors; of the various pixels. Cubic methods 
have emerged as the most significant higher order resampling method and 
sevetal variations exi?t. Classical cubic polynomial Lagrangian interpolation 
is the most common and proauces a smooth resampled image with good frequency 
response. Sidelobes can cause overshoot; however, the cubic case is a good 
compromise oetween artifact introduction and computation cost. A spline 
function developed b> Riffman (1973/ has proven to be quite satisf aetoi’y in 
producing a reasonably shaped passband whicj? provides some high-trequency 
enhancoment : 


fl(x) 

* 1 - 2x^ + jxj 

0*^1 xj < 1 

f2(x) 

» 4 - Sjxl + 5x2 _ [j^|3 


f3(x) 

0 

!xj 
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'this function and its passband are sketched in Fig. C-3 . It uses ± 2 

neighbors for interpolation, has no contribution past 2 pixel spacings, and 

has continuous first derivatives. It has a mean-sciuared resampling error 

about 1/3 that of bilinear (Shlien, 1979), but requires four multiplies for 
each dimension. 

Modifications of the cubic case have been uesigned to minimize undesirable 
characteristics t(6imon (1975), xiffman (19/5), Tabor (1973)] while 

maintaining the four-multiply-each dimension per new pixel cost. Two examples 
will suffice. 

1. By reducing the slope at the first zero crossing of the function of 

bq. C^3 to one halt, a new function (Eq. C-4) is produced having 

essentially the same transient rise distance but with less overshoot; 


C-2 



fl(x) - I - (l/2)t5x2 - 3lxp ] 0^|xhl 

f 2 (x) - (l/2)[4 - 8|x| + 5x2 - |x|3] l<jx|-:2 

t^{x) » 0 2'-(xl 
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I 


2. The functions of Eq. C-3 or C-A are based on the requirement that they 
go through zero at a distance of 1 pixel from center, thus producing 
the negative lobe and the high-trequency enhancement. It, instead, 
Me best approximation to a function represented by the samples is 
desired, and it is recognized that measurements made more than some 
distance away from a given pixel will have no influence on it, an 
interpolating function having limited support (,i.c., local basis) and 
with the smoothest interpolation ot all functions passing through the 
same set of points is desired, tor equal spacing ot the measurements, 
such a function is the cubic B-spline (Hou and Andrews, 1978), having 
continuity in the function and its first two derivatives at the knots 
fthe sample points!, zero slope at the center and at the second knot, 
zero amplitude past the second knot, ano a summation of contributing 
overlapping splines equal unity. Invoking these conditions, the cubic 
b-spline is found to be (symmetrical around x*0): 

f|(x! “ (1/6! [3|xp - 6x2 + 

f£(x! ® (1/6) l“|xP + 6 x2 - I 

f3(x) “ 0 

A plot of this function is 
(Peyrovian, 1976) that for sampling near the Nyquist rate, the cubic 
B-spline is the optimum interpolator. Other interesting properties of 
spline interpolation are given in LaFata and Rosen (1970), Curry and 
Schoenberg (1966), Hou (1S76) and references therein. 


.2 X +8] 


0 ijxp 1 

1 ilxp 2 

2 < 1x1 
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given in Fig.C-4. It can be shown 


Finally, by using more than ±. 2 samples to determine the interpolation 
function, higher order functions may be produced having less mean-squared 
resampling errors than any of the above (Shlien, 1979). However, recognizing 
that Landsat, for example, has a finite amount of sensor noise, the utility of 
reproducing this noise with greater fidelity is suspect, and the higher order 
interpolators have not found widespread use. 


SOME GENEKiiL COMMENTS ON INTERPOLATION: 

1. Cubic convolution in the orginal sense generates overshoot on abrupt 
borders with high brightness differences. This is the reason that the 
modified coefff icients were derived. They have been largely ignored to date. 

2. Cubic convolution causes the apparent spread of small features. But 
small features can only be accurately located (within pixel) through the 
estimated continuum. Very small features such as Evans' (1974! mirrors do not 
satisfy the basic Nyquist criterion and should not be used as a surrogate tor 
the world. To the extent that they are important, they must be evaluated 
independently of classification accuracy and precision spatial location. 



3. Nearest neighbor registration will cause discontinuities in the 
location oi edges. This is particularly disconcerting tor features which lie 
at a small angle with respect to the scanning raster. Small rotations will be 
continually encountered in registering tne nominal overlay images ot a given 
wkS. Interpolation minimizes these discontinuities. 

4. Jayroe (1976) illustrated the severe moire effect caused in the visual 
appearance of images registerea by nearest neighbor. This visual effect is 
minimized with interpolation. 


LFFEGTfa t/N CLAbSIFlCAXION rtCCUBACY 


The discussion so far has concentrated on the ability of the interpolator 
to generate the continuum of which the original pixels were samples, after 
which the continuum was resampled to estimate the dn value of a sample which 
might have been acquired at an inter-original-pixel location. This process 
(except for the nearest neighbor process) generates intermediate dn values, 
and in general produces a smoother image. In the context of misregistration, 
the tradeoff is one of locating an original pixel at the wrong location 
(nearest neighbor) vs locating a new pixel with its attendant new dn value at 
the correct location (interpolator). 

The original pixels are not independent, as shown by the transient 
distance being in the amount of about 1.5 pixels. The additional 
correlation produced by the filtering of the interpolator will be minimal, as 
the basic Nyquist sampling spacing is approximately satisfied. However, if 
contiguous training samples are assumed to be independent, the correlation 
will cause the training class spread to be underestimated. This in turn may 
cause the class decision limits to be changed, with a resulting change in 
At the same time, however, the txltering will reduce the field-interior noise, 
so that for reasonably uniform areas the choice ot interpolation algorithm 
will be second order. 

In Appendix D the conditions for an unbiased field area estimator are 
derived to be tnat the decision level should be midway between the 
biightnesses of the two tields on either side of a boundary. In this case, 
the new dn levels produced at borders using an interpolator will be picxed up 
anyway, and their presence would seem to be inconsequential. If the decision 
Levels are set for classification of the field materials, in general they will 
not be at the midway point. In this case, the new dn levels may well fall 
between the levels as deiined for the adjoining field materials, and may 
therefore not be recognized. However, in this case, many of the true pixels, 
having border brightness changes due to the sensor spread function anyway, 
will also be lost. Nearest neighbor warping would seem to have a slight 
advantage in not producing the additional filtering. Unce classified, only 
nearest neighbor warping can be used unless careful definition of proportion 
estimates are made for fractional pixels. The nearest neighbor pixels will 
have positional errors relative to other spacecraft passes due to the 
inability to locate them precisely onto the reference grid. These positional 
errors require that an estimation of the boundary position be made subsequent 
to classification; for multitemporal overlay the fractional pixel errors will 
counteract the slight advantage of no filtering. 
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Figure C-2 


Nearest Neighbor and Bilinear Interpolation Resampling Techniques. 


From (Joetr et al, 1975 



Figure C-3 TRW Cubic Convolution 


Figure C-A Cubic B-spllne 
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APPENDIX D - LOSS OF BORDER PIXELS 


The progressive shite in the average brightness of a group of pixels as 
the group approaches a boraer (Figure S-b) may be redrawn as Figure D-l. The 
class limit positions here- are consieered to shift with relation to the 
(noisy) group average brightness. The noise distribution, which in reality 
aeviates toward bimoaal (with mixtures more or less 50-50) and then back to 
Gaussian (rrom the adjacent field as it contributes the majority of the area), 
will be lett as Gaussian tor this analysis. Taking the <y of the group average 
as reference, a change in o* is reflected as an apparent change in class size. 
The right part of Figure D-l inaicates the locations of the class extent with 
various amounts of brightness offset (shift due to the border effect), scaled 
as tractions of the class size b. The upper and lower limits of the class, 
after offset, are the limits between which the t^'actional part of the Gaussian 


noise area is 

calculated. 

The result of 

the area 

calculation 

is given 

Table D-l: 








Offset 

Corrected Area 





xS 

0»3 

5 

7 




Zero 

.866 

.988 

l.OOO. 




1/8 

.839 

.969 

.996 




1/4 

.761 

.894 

.960 




3/8 

.642 

.734 

.809 




2/4 

.500 

.500 

.500 




5/8 

.354 

.266 

.191 




3/4 

227 

.106 

.040 




7/8 

.130 

.030 

.004 




4/4 

.067 

.006 





9/8 

.031 






5/4 

.012 





Table D-l. 

Fractions of 

the total area 

under the 

noise curve 

vs amounts 


of brightness shift scaled to class size S, for various g , Initial 
average brightness centered in the class. 


These are the probabilities piotteo in Figure 9-a. The shifts are from a 
starting position exactly centered within the class limits. As aiscussed in 
the body ot the paper, this is optimistic, in that any group ot field-interior 
pixels IS not likely to be centered. But tne field-interior pixels (zero 
offset) are the ones tor which the probability of correct classification was 
calculated, leading to Figure 4. Therefore, the values of Table D-l may be 
corrected by scaling to the tielo-int erior values ot Figure 4; this gives 
Table D-2 which has been plotted in Figure 9-b. 


D-l 



Of fset 

Gorrectea Areas 

Units 


3*5 

3*7 

of S 




0 

.736 

.832 

.677 

l/B 

.713 

.816 

.873 

1/4 

. 646 

.753 

.842 

i/8 

.545 

.618 

.710 

1/2 

.423 

.421 

.439 

5/8 

.300 

.224 

.163 

3/4 

.192 

.089 

.035 

7/8 

.110 

.026 

.004 

1 

.057 

.005 


1-1/8 

.026 



1-1/4 

.010 




Table D-2. Fractions of the total area under the noise curve, based on groups 
of the interior pixels having a uniform probability of brightness. 


Figure D-2 presents an expected transient response curve for the Thematic 
Mapper, as estimated by Morgenstern et al (ERIM Report 11920U-10-F, April 
1976). This curve is very close to that of a Gaussian impulse response 
function, and has a ^ 1.0-90 “ ^ pixel. Along-track and cross-track expected 
responses are almost the same, and will be considered the same for this 
analysis. Unity relative response in Figure D-2 is the transient total 
brightness shift T . What will be important is the shift as scaled to the 
desired class size b. Table D-3 relates the fractional response ys distance 
from the true border (50* point) tor various T/S. 


Distance 


From Boroer 

Fractional 

Shift From 

Shifts 

in Units of S 

Pixels 

Response 

Full Response 

T/S-1 

T/S“2 

T/S*5 

0 

.5 

.5 

.5 

1 

2.5 

.1 

. 6 

.4 

.4 

.8 

2.0 

.2 

.7 

.3 

.3 

.6 

1.5 

.3 

.78 

.22 

.22 

.44 

1.1 

.4 

.86 

.14 

.14 

.28 

.7 

.5 

.9 

.10 

.10 

.20 

.5 

.6 

.95 

.05 

.05 

.10 

.25 

.8 

.98 

.02 

.02 

.04 

.1 

1.0 

1.0 

0 

0 

0 

0 


Table D-3. Relative transient response expected from the Thematic Mapper 
The curves from Table D-3 are chose of the right side of Figure 11. 
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BliVS IN FIELD SIZE ESTIMATION 


The condition for no pixel loss is that the decision brightness for field 
boundary estimation should be midway between the brightnesses of the two 
fields on either side of the boundary. The sensitivity of the setting as it 
aftects the apparent field size may be estimated as follows. (.Figure D-3) 
Define n^ is the length in pixels of the short side of a rectangular 

field 

02 is the length in pixels of the long side of a rectangular 
field 

r * n 2 /d]^ is the field shape ratio 

e is the distance in pixels of the loss in field size due to 
setting of the decision brightness 

T is the difference in brightness of the adjacent fields 

6' is the decision brightness setting, away from the brightness 

of the field being considered 

5 IS the decision brightness setting, scaled to units of T 
X is the Tj^q_qq in pixels 


The fractional area losa F is given by; 


rn2 


by ; 



2 In^Gj + n^d2 

4 

CM 

2 


2 

r 1 n. 

r 

"l 


For equal transient rise distance in both x and y, characteristic of the 
Thematic Mapper, ei “ 62 , so that the fractional area loss becomes 


F 


r + l|2e 
r iij 


1_ ( 2e 

r ' n 


Approximating the center portion of the transient rise by a straight line 
joining the 10% and 90% corner points (Figure D-3), 


JL /-X. 

T ^ 2 


and 


2e 2T 1 . 

■T" T"! *0 " ) 


giving for F; 




The fractional area loss is zero when 6= 1/2. The sensitivity of F to 
changes in the decision level 6 is given by 


d6 


2T[r ■F 1 
n]1 r 


at 6 


1 

2 


Figure D-4 shows the sensitivity (dF/d6 ) for two values of x (1.0 and 1.5) 
and various values of the shape ratio, as functions of n^, the number of 
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pixels in ttie short side oi the iield. The curves are generally hyperbolic 
due to the dominating intluence ot the l/n]^ term, with minor efciects due to 
the shape ratio. However, maintaining the minimum T is necessary; note that t 
is the transient distance due to all causes. 

Figure D-5 shows the fcractional area loss tor various parameters. Again, 
the l/n^ term predominates the shape of the curves, so that errors are 
severe at small field sisses. Unly at 6 «* 1/2 are the errors eliminated, 
tiiven chat * 1/2 will not be the general setting ot the decision level, 
minimization ot F is accomplished by minimizing x and by keeping 6 as close to 
1/2 as possible. ’Uie tield ratio is ot secondary importance, and in any event 
is not under the control of the sensor, Its effect may be estimated: 

2 T 1 

Recast the equation tor F, letting iS ) = A, 

1 1 A 

1 o 

F “ A + “ (A A^) 


Even tor small fields, A<0.2, so that A^ may be ignored, giving 


This is plotted in Figure D-b tor T « 1.5, with vertical scaling tor two 
values ot (0.3 and 0.4). All other things being equal, a change in shape 
from square (r » 1) to rectangular with r = 4 changes F by a ratio ot b/5. 
However, the actuat magniLuue ot F will be small except for very small fields 
and 5 deviating appreciably from 1/2. 

Note tliat tile distance 6 is Clie same as one tialt ot the class size ot the 
discussion on pixels near borders; i.e., 6 ®S/2. 

KEbEAROH NEEUEL) 

'Ihe research problem implied by the ^ = 1/2 condition is: 

• Automatic (at any speed, much less etticiently) blobbing is yet to be 

accomplished (Gupta and Wintz (1975) gave a start to this). The 

blobbing will in general produce fuzzy field boundaries. Continuity 
and closed boundaries must be assured. 

• Vertices must be established to aetine boundary segments. 

• For each spectral band, the correct intensity values ot each set ot 
field interior pixels must be established, after which the 6 = 1/2 
condition may be defined for each segment, 

• The best boundary position may now be found using 6 =1/2. Since 

the decision levels will not be the same as those to be used tor 

classification, they will not, in general, adequately characterize 
the ground cover material. 

• Therefore, reclassification of the field interior pixels must be done 
to identify the materials. But since the field interiors are 
nominally uniform, a small number of surrogate pixels may suffice. 

Reference: 

Gupta J.N. & Wintz P.A. 1975 A Boundary Finding Algorithm and Its 

Applications, IEEE Trans on Circuits and Systems Vol. CAS-22 #4, P. 351. 
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CLASS SIZE S SCALING SHOWN FOR |3=3 


Figure I)-l Construction for Estimating the Fraction of Area Included under 
the Gaussian Noise Curve. The area is that which is included within 
the class boundaries; these boundaries move with respect to the noise 
curve as pixels approach the field borders. 



Figure D-2 Transient Response Estimated for the Thematic Mapper is 
Essentially zero to Full Scale in ± one Pixel. ERIM, 1976. 
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EDGE POSITION 



FIELD 



6' IN REAL UNITS 
6 IN FRACTION OFT 


APPROXIMATE THE CENTER SLOPE AS THAT 
OF A STRAIGHT LINE JOINING THE t,o-90 
LIMITS, LET T BE THIS DISTANCE, 

IN PIXELS. 


D-3 Construction for Estimation of Loss of Field Size vs. d /T 




D-^ Fractional Area Loss (FAL) Sensitivity near the Optimum 

Decision Level Setting (6 ■ '?) as a Function of the Number 
of Pixels in the Short Side (Oj). 
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0.5 » 6*0.1 


6 -FRACTION OF 
TRANSIENT SIZE 



0.5 - \ 5 - 0.1 


f* 1 

r* 1.5 



Fip;ure t>-5 Fractional Area Loss (FAL) for Various Parameter Values 

as a Function of the Number of Pixels In the Short Side (n^. 


FAL FOR "l ~ ^ 
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r - SHAPE RATIO (LOGARITHMIC SCALE) 


Figure D-6 Influence of Shape Ratio on Field Area Lose 





APPENDIX E - TOTAL CLASSIFICATION EFFECTS 


The total probability of correct classification is given by 


p m — iw ( p n. + p.. n., + P vO . ) ■ p.f . + p.. fj. P uf 1 + P 1.^ t. 

rn| '*^11 * ib ib ‘^ob ob*^ 1 *^ib lb * ob ob * xb xb 

The fraction f of the total number of pixels in each zone are (Figure 12): 


Interior 


Inner Border 


Outer Border 


Exterior Border 


1 _ 

r ^ 


f . » 0 

1 


. ^ „ r + 1 1 12 1 

lb r nj^ r n| 

f - J- _ 2 


'lb nj ” rn^ 


f , » 2. 

ob 


r + 1 1 4 1 

r n^ r ^ 


= - 2,21 

[- ss 4 - — j 

ob r nj 


f « 2.— - --- -- + ^ 

xb r r n| 


hi ^ 

nj * 3,4 

£ 4 

nj^ » 3 

nj^ 2 4 
nj^ * 3 

All n 


These are plotted in Figure E-l for r =» 2, and the complete set of values is 
given in Table E-l. It can be seen from the Cable that the values for the 

average of all field sizes closely tracks the values for r»2. This ratio r*2 
will therefore be chosen to represent the world. Verification of the adequacy 
of this choice may be discerned from the plot of Figure E-2, in which 
variation of r has little effect on the probability, as indicated by the 
curves for r=l and r=8 being almost parallel, and close together. 

With r fixed at r“2, attention may now be focused on the probabilities, to 
evaluate the sensitivity to T/S, T , and 3 . The probabilities of correct 
classification for each zone are determined from Figure 10 for the set of 

parameters desired. These are listed in Table E-2 for various T/S, T , and 3 . 

To evaluate the effect of 3 , intermediate values of T , r, and T/S are 
selected, and p vs n| plotted for 3*3 and 5 (Figure E-3) . Decreasing 3 
has the expected effect of decreasing the probability of correct 

classification at any nj^. As the loss in p is about the same at all nj^. 
It is concluded that this loss is essentially independent of n]_. 3 " 3 

will be used to represent the global situation. 

Figure E-4 contains two plots, tor different T (1.5 and 2), of the 
probability vs n^ for families of T/S. The exact shape of the curves 
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1 

5 
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7 

.837 
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.736 
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7 
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5 

5 
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7 
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.02 
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7 
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Table E “ 
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depends on the precise shepe escimaced tor the cransienc rise in Figure 10. 
Note chac the T/S » i condition is the same as the 6 i/2 condition developed 

tor tne tieloi boundary decision; this condition ot no tieio size loss makes 
the probability ot correct decision independent ot r and n^. The curves 
ttom the two plots in Figure L-4 closely superpose tor high (e.g. S: 5) T/b, 
indicating that tor the range ot 1.5<T<2, the resultant probabilities are 
independent ot T . This occurs because the transiem. total shitt is so high 
that recognition~or-not changes over a short range. B'or intermediate Ce.g., 
2) values of T/b, border pixels contribute a varying but signiticant part of 
tne total; in this range> the transient discance T causes variation, and 
should be minimized to minimize the loss in accuracy tor small fields. 

MULTIPLE ACQUIbITIUNb 

An acquisition will be considered to be one data set; that is, one 
spectral banc. Again the basic model considered will be that m each banc 
the S will cause a probability < I of recognizing a within-class pixel, and 
for trend and orcer ot magnitude sensitivity analysis the covariance factors 
will be neglected. An output pixel grid defines the common matrix in which 
all images are analyzed and in which the "true" field position is defined. 

For each acquisition 1, 2, 3, ...q, each pixel in the output grid will 

have an associated probability P derived as above. The resultant total P for 
the set of acquisitions is the product P| P 2 P 3 ... Pq at each pixel. 
Denoting the mean P in the acquisitions at a given point as P^j, 

?l P2 .... Pq » Pjj 

This probability may be carried through the zone analysis as above. 

THE MISREGISTEHED CASE ICONGRUENT FIELDS) 


Figure E-5 is tne construction tor the case ot misregistration ot d 
pixels, assumed to be the same in botii directions. No field shape or size 
distortion or rotation are moueled; shape errors (for one-band analysis) can 
be estimated as the algebraic average of deviation from the ideal position ot 
the border pixels, ana used as d. In this way the possible waviness in field 
shapes caused (tor example) by the acan-line corrector errors can be accounted 
tor. The construction is given in Figure E-7. 

For calculation, the various zones are considered to be composed ot 

components as shown. The loss in accuracy will be reflected in the. lower 
probabilities in the components as they are seen in the misregistered 
position. Thus, the interior zone consists of a portion of the true field 

interior (a) and the component b, which is really in the true inner border 

transition zone. The remainder of the true field interior (i) is seen in the 

inner border. And so on. 
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Thus, Che probabxUties for Cha zones are; 

Incenor; ap£ + bp^j, « U-b)pi^ + bp^b 

Inner Border: ep^b + dp^b + fi"b)pi " fIB-d-b)p]_b+“Pob’''^Pi 

uucer Border: ep^b * ^Pxb * ^•3“'^^Pib " (Ob-f-d)pQb + ^Pxb'*'‘^Pib 

Excernal Border: gpj^b + hp^ + (k»f)pob * CXB-h-f)pxb+hpo+fpob 

^ Po “ 0 (OuCside) 

Lose CO Uucsioe: C2'“h)*d [(r+Dn^ + 

The cocal probabiLiCy la obtained by summing and collaccing tefnns: 


Interior 

' IP, - - p„,) 

Inner Border 

: IllPib + MPi - Pj,,) - 

Outer Border 

I ™Pob P 

External Border 

I I="Pxb 


P » Ipi + IBpib + OBpob + XBpxb 

U I ^ 1 

This is the probability of 
classitication, with ed^e 
effects, before misregis- 
tration. 


- Pol.’ 

■ ’’nl)' ■ ■ ■’xb' 

f'"..!. - "xh’ - "'"xb - 

-h(Pxb) 

I »«i 

This 18 the additional 
loss due to misregistracion. 


Thus, the net effect (Figure E-b) is that the area i , which was originally 
contributing with an ettecc pj{b> is no longer seen. In its place the area 
h, which contributes with an effect Pq* 0 is now covered, bince h ana i. are 
the same size, the net loss in probability is 

"•Pxb “ * i>«l P.xb* 

The area of h, expressed as a fraction of the true field size rn^, is: 

I’ = 

h ^ “l 

The probabilities p^b nre estimated from Figure 10 (Table E-4): 


T/S 

B 

T “ 1 

T » 1.5 

T = 2 


3 

.10 

.14 

.20 

1 

5 

.02 

.025 

.07 


7 

U 

.01 

.04 


3 

0 

0 

0 

2 

5 

0 

0 

0 


7 

0 

0 
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Table E-4 Probability of Correct Recognition of pixels in the 
First Row (External Border, XB) Outside of the True Field 
Boundary 



Combining the tractional area loss with the probabilities tor various 
combinations of the other parameters gives the reeults of Table E-3, the total 
misregistration loss. This is the loss over and above the base accuracy 
attained with finite field sizes but no misregistration. The data in Table 
E-3 are graphed in Figure 14. 

The use of the algebraic average tor small values of wavy distortion of 
one band is developed from Figure E -8 (the global-value case of Figure 10). 
In the region of tne field borders, each curve is more or less linear; for 
combined analysis of trends ana sensitivity analysis, pixels further Inside 
the border (than if registered) will have an increased probability, ana Chose 
nearer or outside will have decreased probability. The (very) approximately 
linear change in P allows the algebraic average to be used, provided that 
coRsnission and omission errors are considered to balance. 

For more chan one band analysis, because of the multiplication of the 
prooaoilities (per pixel) derived from each band, the lowest probability will 
predominate. In this case, the probability will drop as Che number of 
(raisregistered) bands increases, ana commission and omission no longer 
balance. Because of the unpredictable relative locations of the offending 
border pixels, the rms combination of errors from all sources is appropriate. 


MISREGISTRATION DUE TO SIZE AND RATIO (ASPECT) CHANGES 

Size and aspect ratio changes can come about from several causes such as 
scan velocity or altitude changes, and if uncompensated can cause additional 
misregistration errors. The sensitivity Co these can be analyzed using the 
constructions in Figure E-9. Progressive misregistration from a point of best 
registration will be caused by both causes (figure E-9a) ; the modeling of this 
effect will be that first size changes N ■ n'/n will cause a shift in points n 
to points n' both vertically and Horizontally, and that changes in aspect 
ratio will cause furthei/ shifts in the horizontal position of vertical borders 
by changing the field shape ratios by the tactor R ■ r‘/r. The resulting 
shifts are; 


A tVy “ (N-l)ny and Anj^ * (NR-l)rny 


In most analyses, cnis shift will be divided around the borders synraietricai ly 
as optimum field registration is accomplished (Figure E-9b). However, for 

analysis the construction of Figure E-9c is used. The sizes of the border 
position errors are (in pixels); 

“ (N“L)n]_ and 62 * (NK-Umj^ 


Two cases must be distinguished (using scan velocity as a surrogate cause): 

Case I: A slow scan decreases pixel spacing ana puts more pixels into a 
given field. When these are placed into the output grid, the field appears 
stretched. The field as defined by the other (correct) bands now covers only 
part of the stretched field. The classification tends to see only interior 
pixels, and the accuracy will increase, ultimately reaching the field-interior 
accuracy . 
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The UmiCinfe tield-interior claesification accuracy will be reached when 
Che tieid grows by tour pixels in each direction. This occurs when 

N " provided Chat R ^ . 

n n + 4 

Related to field sizesi the limits are: 

Field Short Side, ni, pixels 15 20 25 35 50 100 150 200 

Minimun Limit, R .985 .917 .931 .949 .936 .981 .987 .990 

liaximum Reached when N ■ 1.267 1.200 l.lbO l.llO 1.080 1.040 1.027 1.020 


It can be seen Chat for small fields, Che accuracy increases for quice a wide 
range of R and K, but that for large cields, the limiting accuracy is reached 
with relatively small R and N errors. But at large nj, the field total and 
tield-interior accuracies are close, so that small accuracy changes will be 
produced. 


Case II! A Cast scan has the opposite effect, causing the field to appear 
smaller and the analysis pixels defined by che other bands now include r.‘ jre 
exterior pixels. The classification accuracy will decrease. 


For fast scan, the smaller apparent field covers an area of 

r ^ ( ^ ^ 2 

1 , * m RN (Interior) 

rn j 

Fractional Areas: 


r « 2Nni + 2NRnjj* + 4 (External Border) 


xb 


rn 


1 


The total expected probability is 


^tot “ ^iPi * ^xb Pxb ' 

This can be approximated for Che assumed shape ratio r = 2 and N somewhere 

near 1: 


tot 


RN p. + l- 


1 + 2RN 


n 


1 


+ A 


xb 


bince the external border pixels are now inciuc 
tield, but with a low probability (bee Table E-4), Che fractional area RIn^ 
represents approximately the fraction of Che basic field-interior accuracy to 
oe expected. bince cne total size shrinkage (in pixels) is small for small 
onlyar larger n][ need be considered, ^ 


XllU 1. UUCU W JL 




"I- - . 

Further, tor RN'^ near I, che 


, and the 1/ term may be dropped. 

first term in the square brackets approaches 

. . . I « . 


cuLLucLi LUL i. ^ Uite ueLia in une squa 

3/n]_. Thus the total prooaoility may be approximated by 


^tot * RN^P£ + 



TT • 

^xb 
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Figure E-4 Variation of Probability for Different Values of T/S 
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Figure E-5 Component Areas for the Mlswglstered Case 
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Flpure E-7 General Case of Mstorted Fields 



Fipure E“8 Edge Pixel Classification Accuracy (From Flpure 10) 
and Linear Approximations 
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a. general case b. BEST REGISTRATION CASE 

(NO TRANSLATION DISPLACEMENT) 



c. ANALYSIS CONSTRUCTION 

Figure E-9 Construction for Estimating Misregistration Caused by 
Size and Aspect Ratio Errors. 
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APPENDIX F - RELIEF DISPLACEMENT 


The scenario used for relief displacement estimations is that for each 
image, the system processing will estimate a best fit plane using the ground 
control points, and correct the data to this. Deviations in the altitude of 
intermediate image points from this plane will cause them to be displaced in 
the image. Registration of subsequent images to the master will be done at a 
selected sparse array of control points, using either the ground control 
points or a set of relative control points. 


The construction for estimation of effects is given in Figure F-1. From 
similar triangles. 


approximated by: 



which, since h«H, can be 


The relief displacement p is seen to be dependent on the angle of view as 
represented by the distance d from the nadir to the point in question. 

Landsat does not return precisely to the same location in viewing a given 
WRS, so that two versions of the same scene may be imaged from two vantage 
points close together but not coincident. This is the normal stereo 
construction. These are indicated as points 1 and 2 in Figure F-1. Using 
this construction, the shift R in relief displacement for points at different 
altitudes is 



Note that this shift R is constant at all points in the image, and grows with 
increase in center separation D. (In stereo parlance, D is the stereo base.) 
Thus, even though the average ground location of each of a pair of images will 
be identified using the ephemeris and attitude data, ground altitude 
deviations will cause individual field boundaries to be displaced. For the 
range of center separations D to be expected, the amount of shift R in local 
points may be within the range (say, 0.2 to 1 pixel) which can cause edge, 
pixel effects. 

Thus, precision analysis and mapping will require the correction of image 
data location for all pixels. A candidate source of the required altitude 
data is the USGS surface altitude digital data, provided that it can be 
registered to Landsat to the required accuracy. 
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Flf?ure F-1 Construction for Relief Displacement (p) and 
Relief Displacement Shift (R) 



PLANE FOR LANDSAT-D 
(METERS) 


Figure F-2 Relief Displacement Shift R as a Function of Nadir 
Separation for Varying Altitude of the Ground above 
a Reference Plane 
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